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NOVEL HUMAN GENES AND GENE EXPRESSION PRODUCTS I 

Cross-References to Related Applications 

This application is a continuation-in-part of U.S. provisional patent application serial 
no. 60/068,755, filed December 23, 1997, and of U.S. provisional patent application serial 
no. 60/080,664, filed April 3, 1998, and of U.S. provisional patent application serial no. 
60/105,234, filed October 21, 1998, each of which applications are incorporated herein by 
reference. 

Field of the Invention 

The present invention relates to novel polynucleotides, particularly to novel 
polynucleotides of human origin that are expressed in a selected cell type, are differentially 
expressed in one cell type relative to another cell type (e.g., in cancerous cells, or in cells of: 
specific tissue origin) and/or share homology to polynucleotides encoding a gene product 
having an identified functional domain and/or activity. 

Background of the Invention 

Identification of novel polynucleotides, particularly those that encode an expressed 
gene product, is important in the advancement of drug discovery, diagnostic technologies, 
and the understanding of the progression and nature of complex diseases such as cancer. 
Identification of genes expressed in different cell types isolated from sources that differ in 
disease state or stage, developmental stage, exposure to various environmental factors, the 
tissue of origin, the species from which the tissue was isolated, and the like is key to 
identifying the genetic factors that are responsible for the phenotypes associated with these 
various differences 

This invention provides novel human polynucleotides, the polypeptides encoded by 
these polynucleotides, and the genes and proteins corresponding to these novel 
polynucleotides. 

Summary of the Invention 

This invention relates to novel human polynucleotides and variants thereof, their 
encoded polypeptides and variants thereof, to genes corresponding to these polynucleotides 

1 
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and to proteins expressed by the genes. The invention also relates to diagnostic and 

therapeutic agents employing such novel human polynucleotides, their corresponding genes 
or gene products, e.g., these genes and proteins, including probes, antisense constructs, and 
antibodies. 

Accordingly, in one embodiment, the present invention features a library of 
polynucleotides, the library comprising the sequence information of at least one of SEQ ID 
NOS: 1 -844. In related aspects, the invention features a library provided on a nucleic acid 
array, or in a computer-readable format. 

In one embodiment, the library is comprises a differentially expressed polynucleotide 
comprising a sequence selected from the group consisting of SEQ ID NOS:9, 39, 42, 52, 62, 
74, 1 19, 172, 317, and 379. In specific related embodiments, the librarv 
polynucleotide that is differentially expressed in a human breast cancer cell, where the 
polynucleotide comprises a sequence selected from the group consisting of SEQ ID NOS: 4, 
9, 39, 42, 52, 62, 65, 66, 68, 74, 81, 114, 123, 144, 130, 157, 162, 172, 178, 183, 202, 214, 
219, 223, 258, 298, 317, 338, 379, 384, 386, and 388; 2) a polynucleotide differentially 
expressed in a human colon cancer cell, where the polynucleotide comprises a sequence 
selected from the group consisting of SEQ ID NOS: 1, 39, 52, 97, 1 19, 134, 172, 176, 241, 
288, 317, 357, 362, and 374; or 3) a polynucleotide differentially expressed in a human lung 
cancer cell, where the polynucleotide comprises a sequence selected from the group 
consisting of SEQ ID NOS: 9, 34, 42, 62, 74, 106, 1 19, 135, 154, 160, 260, 308, 323, 349, 
361, 369, 371, 379, 395, 381, and 400. 

In another aspect, the invention features an isolated polynucleotide 
comprising a nucleotide sequence having at least 90% sequence identity to an identifying 
sequence of SEQ ID NOS: 1 -844 or a degenerate variant thereof. In related aspects, the 
invention features recombinant host cells and vectors comprising the polynucleotides of the 
invention, as well as isolated polypeptides encoded by the polynucleotides of the invention 
and antibodies that specifically bind such polypeptides. 

In one embodiment, the invention features an isolated polynucleotide comprising a 
sequence encoding a polypeptide of a protein family selected from the group consisting of: 
4 transmembrane segments integral membrane proteins, 7 transmembrane receptors, 
ATPases associated with various cellular activities (AAA), eukaryotic aspartyl proteases, 
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GATA family of transcription factors, G-protein alpha subunit, phorbol esters/diacylglycerol 

binding proteins, protein kinase, protein phosphatase 2C, protein tyrosine phosphatase, 

trypsin, wnt family of developmental signaling proteins, and WW/rsp5/WWP domain 

containing proteins. In a specific related embodiment, the invention features a 

5 polynucleotide comprising a sequence of one of SEQ ID NOS: 24, 41, 101, 157, 291, 305, 

315, 341, 63, 116, 134, 136, 151, 384, 404, 308, 213, 367, 188, 251, 202, 315, 367, 397, 256, 

382, 169, 23, 291, 324, 330, 341, 353, 188, 379 , and 395. 

In another embodiment, the invention features a polynucleotide comprising a 

sequence encoding a polypeptide having a functional domain selected from the group 

10 consisting of: Ank repeat, basic region plus leucine zipper transcription factors, 

bromodomain, EF-hand, SH3 domain, WD domain/G-beta repeats, zinc finger (C2H2 type), 
zinc finger (CCHC class), and zinc-binding metalloprotease domain. In a specific related 
embodiment, the invention features a polynucleotide comprising a sequence of one of SEQ 
ID NOS: 116, 251, 374, 97, 136, 242, 379, 306, 386, 18, 335, 61, 306, 386, 322, 306, and 

15 395. 

In another aspect, the invention features a method of detecting differentially 
expressed genes correlated with a cancerous state of a mammalian cell, where the method 
comprises the step of detecting at least one differentially expressed gene product in a test 
sample derived from a cell suspected of being cancerous, where the gene product is encoded 

20 by a gene corresponding to a sequence of at least one of SEQ ID NOS:4, 9, 39, 42, 52, 62, 

65, 66, 68, 74, 81, 1 14, 123, 144, 130, 157, 162, 172, 178, 183, 202, 214, 219, 223, 258, 298, 
317, 338, 379, 384, 386, 388, 1, 39, 52, 97, 119, 134, 172, 176, 241, 288, 317, 357, 362, 374, 
9, 34, 42, 62, 74, 106, 119, 135, 154, 160, 260, 308, 323, 349, 361, 369, 371, 379, 395, 381, 
and 400. Detection of the differentially expressed gene product is correlated with a 

25 cancerous state of the cell from which the test sample was derived. In one embodiment, the 
detecting is by hybridization of the test sample to a reference array, wherein the reference 
array comprises an identifying sequence of at least one of SEQ ID NOS:l-844. 

In one embodiment of the method of the invention, the cell is a breast tissue derived 
cell, and the differentially expressed gene product is encoded by a gene corresponding to a 

30 sequence of at least one of SEQ ID NOS: 4, 9, 39, 42, 52, 62, 65, 66, 68, 74, 81, 1 14, 123, 
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144, 130, 157, 162, 172, 178, 183, 202, 214, 219, 223, 258, 298, 317, 338, 379, 384, 386, 

and 388. 

In another embodiment of the method of the invention, the cell is a colon tissue 
derived cell, and differentially expressed gene product is encoded by a gene corresponding to 
5 a sequence of at least one of SEQ ID NOS: 1, 39, 52, 97, 1 19, 134, 172, 176, 241, 288, 317, 
357, 362, and 374. 

In yet another embodiment of the method of the invention, the cell is a lung tissue 
derived cell, and differentially expressed gene product is encoded by a gene corresponding to 
a sequence of at least one of SEQ ID NOS: 9, 34, 42, 62, 74, 106, 119, 135, 154, 160, 260, 
10 308, 323, 349, 361, 369, 371, 379, 395, 381, and 400. 

Other aspects and embodiments of the invention will be readily apparent to the 
ordinarily skilled artisan upon reading the description provided herein. 

Detailed Descri ption of the Invention 
1 5 The invention relates to polynucleotides comprising the disclosed nucleotide 

sequences, to full length cDNA, mRNA and genes corresponding to these sequences, and to 
polypeptides and proteins encoded by these polynucleotides and genes. 

Also included are polynucleotides that encode polypeptides and proteins encoded by 
the polynucleotides of the Sequence Listing. The various polynucleotides that can encode 
20 these polypeptides and proteins differ because of the degeneracy of the genetic code, in that 
most amino acids are encoded by more than one triplet codon. The identity of such codons 
is well-known in this art, and this information can be used for the construction of the 
polynucleotides within the scope of the invention. 

Polynucleotides encoding polypeptides and proteins that are variants of the 
25 polypeptides and proteins encoded by the polynucleotides and related cDNA and genes are 
also within the scope of the invention. The variants differ from wild type protein in having 
one or more amino acid substitutions that either enhance, add, or diminish a biological 
activity of the wild type protein. Once the amino acid change is selected, a polynucleotide 
encoding that variant is constructed according to the invention. 
30 The following detailed description describes the polynucleotide compositions 

encompassed by the invention, methods for obtaining cDNA or genomic DNA encoding a 
full-length gene product, expression of these polynucleotides and genes, identification of 

4 
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structural motifs of the polynucleotides and genes, identification of the function ot a gene 
product encoded by a gene corresponding to a polynucleotide of the invention, use of the 
provided polynucleotides as probes and in mapping and in tissue profiling, use of the 
corresponding polypeptides and other gene products to raise antibodies, and use of the 
5 polynucleotides and their encoded gene products for therapeutic and diagnostic purposes. 

I. Polynucleotide Compositions 

The scope of the invention with respect to polynucleotide compositions includes, but is not 
necessarily limited to, polynucleotides having a sequence set forth in any one of SEQ ID 
10 NOS:l-844; polynucleotides obtained from the biological materials described herein or other 
biological sources (particularly human sources) by hybridization under stringent conditions 
(particularly conditions of high stringency); genes corresponding to the provided 
polynucleotides; variants of the provided polynucleotides and their corresponding genes, 
particularly those variants that retain a biological activity of the encoded gene product (e.g. , 
1 5 a biological activity ascribed to a gene product corresponding to the provided 

polynucleotides as a result of the assignment of the gene product to a protein family(ies) 
and/or identification of a functional domain present in the gene product). Other nucleic acid 
compositions contemplated by and within the scope of the present invention will be readily 
apparent to one of ordinary skill in the art when provided with the disclosure here. 
20 The invention features polynucleotides that are expressed in cells of human tissue, 

specifically human colon, breast, and/or lung tissue. Novel nucleic acid compositions of the 
invention of particular interest comprise a sequence set forth in any one of SEQ ID NOS: 1 - 
844 or an identifying sequence thereof. An "identifying sequence" is a contiguous sequence 
of residues at least about 10 nt to about 20 nt in length, usually at least about 50 nt to about 
25 1 00 nt in length, that uniquely identifies a polynucleotide sequence, e.g. , exhibits less than 
90%, usually less than about 80% to about 85% sequence identity to any contiguous 
nucleotide sequence of more than about 20 nt. Thus, the subject novel nucleic acid 
compositions include full length cDNAs or mRNAs that encompass an identifying sequence 
of contiguous nucleotides from any one of SEQ ID NOS: 1-844. 
30 The polynucleotides of the invention also include polynucleotides having sequence 

similarity or sequence identity. Nucleic acids having sequence similarity are detected by 
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hybridization under low stringency conditions, for example, at 50°C and 10XSSC (0.9 M 

saline/0.09 M sodium citrate) and remain bound when subjected to washing at 55°C in 
1XSSC. Sequence identity can be determined by hybridization under stringent conditions, 
for example, at 50°C or higher and 0. 1XSSC (9 mM saline/0.9 mM sodium citrate). 
Hybridization methods and conditions are well known in the art, see, e.g., U.S. Patent No. 
5,707,829. Nucleic acids that are substantially identical to the provided polynucleotide 
sequences, e.g. allelic variants, genetically altered versions of the gene, etc., bind to the 
provided polynucleotide sequences (SEQ ID NOS: 1-844) under stringent hybridization 
conditions. By using probes, particularly labeled probes of DNA sequences, one can isolate 
homologous or related genes. The source of homologous genes can be any species, e.g. 
primate species, particularly human; rodents, such as rats and mice, canines, felines, bovines, 
ovines, equines, yeast, nematodes, etc. 

Preferably, hybridization is performed using at least 15 contiguous nucleotides of at 
least one of SEQ ID NOS: 1-844. That is, when at least 15 contiguous nucleotides of one of 
the disclosed SEQ ID NOs. is used as a probe, the probe will preferentially hybridize with a 
gene or mRNA (of the biological material) comprising the complementary sequence, 
allowing the identification and retrieval of the nucleic acids of the biological material that 
uniquely hybridize to the selected probe. Probes from more than one SEQ ID NO. will 
hybridize with the same gene or mRNA if the cDNA from which they were derived 
corresponds to one mRNA. Probes of more than 15 nucleotides can be used, but 15 
nucleotides represents enough sequence for unique identification. 

The polynucleotides of the invention also include naturally occurring variants of the 
nucleotide sequences (e.g., degenerate variants, allelic variants, etc.). Variants of the 
polynucleotides of the invention are identified by hybridization of putative variants with 
nucleotide sequences disclosed herein, preferably by hybridization under stringent conditions 
For example, by using appropriate wash conditions, variants of the polynucleotides of the 
invention can be identified where the allelic variant exhibits at most about 25-30% base pair 
mismatches relative to the selected polynucleotide probe. In general, allelic variants contain 
15-25% base pair mismatches, and can contain as little as even 5-15%, or 2-5%, or 1-2% 
base pair mismatches, as well as a single base-pair mismatch. 
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The invention also encompasses homologs corresponding to the polynucleotides ot 
SEQ ID NOS: 1-844, where the source of homologous genes can be any mammalian species, 
e.g., primate species, particularly human; rodents, such as rats, canines, felines, bovines, 
ovines, equines, yeast, nematodes, etc. Between mammalian species, e.g., human and 

5 mouse, homologs have substantial sequence similarity, e.g., at least 75% sequence identity, 
usually at least 90%, more usually at least 95% between nucleotide sequences. Sequence 
similarity is calculated based on a reference sequence, which may be a subset of a larger 
sequence, such as a conserved motif, coding region, flanking region, etc. A reference 
sequence will usually be at least about 18 contiguous nt long, more usually at least about 30 

1 0 nt long, and may extend to the complete sequence that is being compared. Algorithms for 
sequence analysis are known in the art, such as BLAST, described in Altschul et ah, J. Mol. 

Biol. (1990) 275:403-10. 

In general, variants of the invention have a sequence identity greater than at least 
about 65%, preferably at least about 75%, more preferably at least about 85%, and can be 
1 5 greater than at least about 90% or more as determined by the Smith-Waterman homology 
search algorithm as implemented in MPSRCH program (Oxford Molecular). For the 
purposes of this invention, a preferred method of calculating percent identity is the Smith- 
Waterman algorithm, using the following. Global DNA sequence identity must be greater 
than 65% as determined by the Smith-Waterman homology search algorithm as implemented 
20 in MPSRCH program (Oxford Molecular) using an affine gap search with the following 
search parameters: gap open penalty, 12; and gap extension penalty, 1 . 

The subject nucleic acids can be cDNAs or genomic DNAs, as well as fragments 
thereof, particularly fragments that encode a biologically active gene product and/or are 
useful in the methods disclosed herein {e.g., in diagnosis, as a unique identifier of a 
25 differentially expressed gene of interest, etc.). The term "cDNA" as used herein is intended 
to include all nucleic acids that share the arrangement of sequence elements found in native 
mature mRNA species, where sequence elements are exons and 3 and 5 non-coding 
regions. Normally mRNA species have contiguous exons, with the intervening introns, 
when present, being removed by nuclear RNA splicing, to create a continuous open reading 
30 frame encoding a polypeptide of the invention. 
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A genomic sequence of interest comprises the nucleic acid present between the 

initiation codon and the stop codon, as defined in the listed sequences, including all of the 
introns that are normally present in a native chromosome. It can further include the 3 and 5 
untranslated regions found in the mature mRNA. It can further include specific 
transcriptional and translation^ regulatory sequences, such as promoters, enhancers, etc., 
including about 1 kb, but possibly more, of flanking genomic DNA at either the 5 and 3 ' end 
of the transcribed region. The genomic DNA can be isolated as a fragment of 1 00 kbp or 
smaller; and substantially free of flanking chromosomal sequence. The genomic DNA 
flanking the coding region, either 3 and 5 , or internal regulatory sequences as sometimes 
found in introns, contains sequences required for proper tissue, stage-specific, or disease- 
state specific expression. 

The nucleic acid compositions of the subject invention can encode all or a part of the 
subject differentially expressed polypeptides. Double or single stranded fragments can be 
obtained from the DNA sequence by chemically synthesizing oligonucleotides in accordance 
with conventional methods, by restriction enzyme digestion, by PCR amplification, etc. 
Isolated polynucleotides and polynucleotide fragments of the invention comprise at least 
about 10, about 15, about 20, about 35, about 50, about 100, about 150 to about 200, about 
250 to about 300, or about 350 contiguous nucleotides selected from the polynucleotide 
sequences as shown in SEQ ID NOS.1-844. For the most part, fragments will be of at least 
1 5 nt, usually at least 1 8 nt or 25 nt, and up to at least about 50 contiguous nt in length or 
more. In a preferred embodiment, the polynucleotide molecules comprise a contiguous 
sequence of at least twelve nucleotides selected from the group consisting of the 
polynucleotides shown in SEQ ID NOS: 1-844. 

Probes specific to the polynucleotides of the invention can be generated using the 
polynucleotide sequences disclosed in SEQ ID NOS:l-844. The probes are preferably at 
least about 12, 15, 16, 18, 20, 22, 24, or 25 nucleotide fragment of a corresponding 
contiguous sequence of SEQ ID NOS:l-844, and can be less than 2, 1, 0.5, 0.1, or 0.05 kb in 
length. The probes can be synthesized chemically or can be generated from longer 
polynucleotides using restriction enzymes. The probes can be labeled, for example, with a 
radioactive, biotinylated, or fluorescent tag. Preferably, probes are designed based upon an 
identifying sequence of a polynucleotide of one of SEQ ID NOS: 1 -844. More preferably, 
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probes are designed based on a contiguous sequence of one of the subject polynucleotides 

that remain unmasked following application of a masking program for masking low 

complexity (e.g., XBLAST) to the sequence., i.e., one would select an unmasked region, as 

indicated by the polynucleotides outside the poly-n stretches of the masked sequence 

5 produced by the masking program. 

The polynucleotides of the subject invention are isolated and obtained in substantial 

purity, generally as other than an intact chromosome. Usually, the polynucleotides, either as 

DNA or RNA, will be obtained substantially free of other naturally-occurring nucleic acid 

sequences, generally being at least about 50%, usually at least about 90% pure and are 

10 typically "recombinant", e.g., flanked by one or more nucleotides with which it is not 
normally associated on a naturally occurring chromosome. 

The polynucleotides of the invention can be provided as a linear molecule or within a 
circular molecule. They can be provided within autonomously replicating molecules 
(vectors) or within molecules without replication sequences. They can be regulated by their 

1 5 own or by other regulatory sequences, as is known in the art. The polynucleotides of the 
invention can be introduced into suitable host cells using a variety of techniques which are 
available in the art, such as transferrin polycation-mediated DNA transfer, transfection with 
naked or encapsulated nucleic acids, liposome-mediated DNA transfer, intracellular 
transportation of DNA-coated latex beads, protoplast fusion, viral infection, electroporation, 

20 gene gun, calcium phosphate-mediated transfection, and the like. 

The subject nucleic acid compositions can be used to, for example, produce 
polypeptides, as probes for the detection of mRNA of the invention in biological samples 
(e.g., extracts of human cells) to generate additional copies of the polynucleotides, to 
generate ribozymes or antisense oligonucleotides, and as single stranded DNA probes or as 

25 triple-strand forming oligonucleotides. The probes described herein can be used to, for 
example, determine the presence or absence of the polynucleotide sequences as shown in 
SEQ ID NOS: 1-844 or variants thereof in a sample. These and other uses are described in 
more detail below. 
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Use of Pol ynucleotides to Obtain Full-Leneth cDNA and Ful1-T.ength Human Gene anH 

Promoter Region 

Full-length cDNA molecules comprising the disclosed polynucleotides are obtained 
as follows. A polynucleotide having a sequence of one of SEQ ID NOS: 1-844, or a portion 
5 thereof comprising at least 1 2, 1 5, 1 8, or 20 nucleotides, is used as a hybridization probe to 
detect hybridizing members of a cDNA library using probe design methods, cloning 
methods, and clone selection techniques such as those described in U.S. Patent No. 
5,654,173. Libraries of cDNA are made from selected tissues, such as normal or tumor 
tissue, or from tissues of a mammal treated with, for example, a pharmaceutical agent. 
1 0 Preferably, the tissue is the same as the tissue from which the polynucleotides of the 
invention were isolated, as both the polynucleotides described herein and the cDNA 
represent expressed genes. Most preferably, the cDNA library is made from the biological 
material described herein in the Examples. Alternatively, many cDNA libraries are available 
commercially. (Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed, (1989) 
1 5 Cold Spring Harbor Press, Cold Spring Harbor, NY). The choice of cell type for library 

construction can be made after the identity of the protein encoded by the gene corresponding 
to the polynucleotide of the invention is known. This will indicate which tissue and cell 
types are likely to express the related gene, and thus represent a suitable source for the 
mRNA for generating the cDNA. Where the provided polynucleotides are isolated from 
20 cDNA libraries, the libraries are prepared from mRNA of human colon cells, more 

preferably, human colon cancer cells, even more preferably, from a highly metastatic colon 
cell, Kml2L4-A. 

Techniques for producing and probing nucleic acid sequence libraries are described, 
for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed„(l 989) 
25 Cold Spring Harbor Press, Cold Spring Harbor, NY. The cDNA can be prepared by using 
primers based on sequence from SEQ ID NOS: 1-844. In one embodiment, the cDNA library 
can be made from only poly-adenylated mRNA. Thus, poly-T primers can be used to 
prepare cDNA from the mRNA. 

Members of the library that are larger than the provided polynucleotides, and 
preferably that encompass the complete coding sequence of the native message, are obtained. 
In order to confirm that the entire cDNA has been obtained, RNA protection experiments 
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are performed as follows. Hybridization of a full-length cDNA to an mRNA will protect the 
RNA from RNase degradation. If the cDNA is not full length, then the portions of the 
mRNA that are not hybridized will be subject to RNase degradation. This is assayed, as is 
known in the art, by changes in electrophoretic mobility on polyacrylamide gels, or by 
detection of released monoribonucleotides. Sambrook et al, Molecular Cloning: A 
Laboratory Manual, 2nd Ed, (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY. In 
order to obtain additional sequences 5' to the end of a partial cDNA, 5' RACE (PCR 
Protocols: A Guide to Methods and Applications, (1990) Academic Press, Inc.) is performed. 

Genomic DNA is isolated using the provided polynucleotides in a manner similar to 
the isolation of full-length cDNAs. Briefly, the provided polynucleotides, or portions 
thereof, are used as probes to libraries of genomic DNA. Preferably, the library is obtained 
from the cell type that was used to generate the polynucleotides of the invention, but this is 
not essential. Most preferably, the genomic DNA is obtained from the biological material 
described herein in the Examples. Such libraries can be in vectors suitable for carrying large 
segments of a genome, such as PI or YAC, as described in detail in Sambrook et al, 9.4- 
9.30. In addition, genomic sequences can be isolated from human BAC libraries, which are 
commercially available from Research Genetics, Inc., Huntville, Alabama, USA, for 
example. In order to obtain additional 5' or 3' sequences, chromosome walking is performed, 
as described in Sambrook et al, such that adjacent and overlapping fragments of genomic 
DNA are isolated. These are mapped and pieced together, as is known in the art, using 
restriction digestion enzymes and DNA ligase. 

Using the polynucleotide sequences of the invention, corresponding full-length genes 
can be isolated using both classical and PCR methods to construct and probe cDNA libraries. 
Using either method, Northern blots, preferably, are performed on a number of cell types to 
determine which cell lines express the gene of interest at the highest level. Classical 
methods of constructing cDNA libraries are taught in Sambrook et al, supra. With these 
methods, cDNA can be produced from mRNA and inserted into viral or expression vectors. 
Typically, libraries of mRNA comprising poly(A) tails can be produced with polyfT) 
primers. Similarly, cDNA libraries can be produced using the instant sequences as primers. 

PCR methods are used to amplify the members of a cDNA library that comprise the 
desired insert. In this case, the desired insert will contain sequence from the full length 
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CDNA that corresponds to the instant polynucleotides. Such PCR methods include gene 

trapping and RACE methods. Gene trapping entails inserting a member of a cDNA library 
into a vector. The vector then is denatured to produce single stranded molecules. Next, a 
substrate-bound probe, such a biotinylated oligo, is used to trap cDNA inserts of interest. 
Biotinylated probes can be linked to an avidin-bound solid substrate. PCR methods can be 
used to amplify the trapped cDNA. To trap sequences corresponding to the full length 
genes, the labeled probe sequence is based on the polynucleotide sequences of the invention. 
Random primers or primers specific to the library vector can be used to amplify the trapped 
cDNA. Such gene trapping techniques are described in Gruber et al. t WO 95/04745 and 
Gruber et al, U.S. Pat. No. 5,500,356. Kits are commercially available to perform gene 
trapping experiments from, for example, Life Technologies, Gaithersburg, Maryland, USA. 

"Rapid amplification of cDNA ends," or RACE, is a PCR method of amplifying 
cDNAs from a number of different RNAs. The cDNAs are ligated to an oligonucleotide 
linker, and amplified by PCR using two primers. One primer is based on sequence from the 
instant polynucleotides, for which full length sequence is desired, and a second primer 
comprises sequence that hybridizes to the oligonucleotide linker to amplify the cDNA. A 
description of this methods is reported in WO 97/191 10. In preferred embodiments of 
RACE, a common primer is designed to anneal to an arbitrary adaptor sequence ligated to 
cDNA ends (Apte and Siebert, Biotechniques (1993) 75:890-893; Edwards et al, Nuc. Acids 
Res. (1991) 7P.-5227-5232). When a single gene-specific RACE primer is paired with the 
common primer, preferential amplification of sequences between the single gene specific 
primer and the common primer occurs. Commercial cDNA pools modified for use in RACE 
are available. 

Another PCR-based method generates full-length cDNA library with anchored ends 
without needing specific knowledge of the cDNA sequence. The method uses lock-docking 
primers (I-VI), where one primer, poly TV (Mil) locks over the polyA tail of eukaryotic 
mRNA producing first strand synthesis and a second primer, polyGH (IV-VI) locks onto the 
polyC tail added by terminal deoxynucleotidyl transferase (TdT). This method is described 
in WO 96/40998. 

The promoter region of a gene generally is located 5' to the initiation site for RNA 
polymerase II. Hundreds of promoter regions contain the "TATA" box, a sequence such as 
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TAlTAorTATAA, which is sensitive to mutations. The promoter region can be obtained 
by performing 5' RACE using a primer from the coding region of the gene. Alternatively, 
the cDNA can be used as a probe for the genomic sequence, and the region 5' to the coding 
region is identified by "walking up." If the gene is highly expressed or differentially 
expressed, the promoter from the gene can be of use in a regulatory construct for a 
heterologous gene. 

Once the full-length cDNA or gene is obtained, DNA encoding variants can be 
prepared by site-directed mutagenesis, described in detail in Sambrook et ah, 15.3-15.63. 
The choice of codon or nucleotide to be replaced can be based on disclosure herein on 
optional changes in amino acids to achieve altered protein structure and/or function. 

As an alternative method to obtaining DNA or RNA from a biological material, 
nucleic acid comprising nucleotides having the sequence of one or more polynucleotides of 
the invention can be synthesized. Thus, the invention encompasses nucleic acid molecules 
ranging in length from 15 nucleotides (corresponding to at least 15 contiguous nucleotides of 
1 5 one of SEQ ID NOS: 1 -844) up to a maximum length suitable for one or more biological 
manipulations, including replication and expression, of the nucleic acid molecule. The 
invention includes but is not limited to (a) nucleic acid having the size of a full gene, and 
comprising at least one of SEQ ID NOS: 1-844; (b) the nucleic acid of (a) also comprising at 
least one additional gene, operably linked to permit expression of a fusion protein; (c) an 
20 expression vector comprising (a) or (b); (d) a plasmid comprising (a) or (b) ; and (e) a 

recombinant viral particle comprising (a) or (b). Once provided with the polynucleotides 
disclosed herein, construction or preparation of (a) - (e) are well within the skill in the art. 

The sequence of a nucleic acid comprising at least 15 contiguous nucleotides of at 
least any one of SEQ ID NOS: 1 -844, preferably the entire sequence of at least any one of 
25 SEQ ID NOS: 1-844, is not limited and can be any sequence of A, T, G, and/or C (for DNA) 
and A, U, G, and/or C (for RNA) or modified bases thereof, including inosine and 
pseudouridine. The choice of sequence will depend on the desired function and can be 
dictated by coding regions desired, the intron-like regions desired, and the regulatory regions 
desired. Where the entire sequence of any one of SEQ ID NOS : 1-844 is within the nucleic 
30 acid, the nucleic acid obtained is referred to herein as a polynucleotide comprising the 
sequence of any one of SEQ ID NOS: 1-844. 
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IL Expression of Polypeptide Encoded bv Full-T.^ th . DNA or Fll „. T pn „ ih n „ M 

The provided polynucleotide (e.g., a polynucleotide having a sequence of one of SEQ 
ID NOS.1-844), the corresponding cDNA, or the full-length gene is used to express a partial 
5 or complete gene product. 

Constructs of polynucleotides having sequences of SEQ ID NOS:l-844 can be 
generated synthetically. Alternatively, single-step assembly of a gene and entire plasmid 
from large numbers of oligodeoxyribonucleotides is described by, e.g. , Stemmer et al, Gene 
(Amsterdam) (1995) 7<*f7,:49-53. In this method, assembly PGR (the synthesis of long 
10 DNA sequences from large numbers of oligodeoxyribonucleotides (oligos)) is described. 
The method is derived from DNA shuffling (Stemmer, Nature (1994) 570:389-39!), and 
does not rely on DNA Iigase, but instead relies on DNA polymerase to build increasingly 
longer DNA fragments during the assembly process. For example, a 1.1 -kb fragment 
containing the TEM-1 beta-lactamase-encoding gene (bla) can be assembled in a single 
15 reaction from a total of 56 oligos, each 40 nucleotides (nt) in length. The synthetic gene can 
be PCR amplified and cloned in a vector containing the tetracycline-resistance gene (Tc-R) 
as the sole selectable marker. Without relying on ampicillin (Ap) selection, 76% of the Tc-R 
colonies were Ap-R, making this approach a general method for the rapid and cost-effective 
synthesis of any gene. 

20 Appropriate polynucleotide constructs are purified using standard recombinant DNA 

techniques as described in, for example, Sambrook et al., Molecular Cloning: A Laboratory 
Manual, 2nd Ed., (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY, and under 
current regulations described in United States Dept. of HHS, National Institute of Health 
(NIH) Guidelines for Recombinant DNA Research. The gene product encoded by a 
polynucleotide of the invention is expressed in any expression system, including, for 
example, bacterial, yeast, insect, amphibian and mammalian systems. Suitable vectors and 
host cells are described in U.S. Patent No. 5,654,173. 

Bacteria, Expression systems in bacteria include those described in Chang et al 
Nature (1978) 275:615; Goeddel Nature (1979) 257:544; Goeddel et al, Nucleic Acids 
30 Res. (1980) 5:4057; EP 0 036,776; U.S. Patent No. 4,551,433; DeBoer etal, Proc. Natl. 
Acad. Sci. (USA) (1983) 50:21-25; and Siebenlist et al, Cell (1980) 20:269. 
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Yeast. Expression systems in yeast include those described m Hinnen et al., Proc. 

Natl. Acad. Sci. (USA) (1978) 75:1929; Ito et al, J. Bacterial. (1983) 753:163; Kurtz et al, 
Mol. Cell. Biol. (1986) 6: 1 42; Kunze et al., J. Basic Microbiol. (1985) 25:141; Gleeson et 
al.,J. Gen. Microbiol. (1986) 732:3459; Roggenkamp et al, Mol. Gen. Genet. (1986) 
5 202:302; Das et al, J. Bacteriol. (1984) 755:1 165; De Louvencourt et al, J. Bacteriol 
(1983) 154:737; Van den Berg et al, Bio/Technology (1990) 5:135; Kunze et al, J. Basic 
Microbiol (1985) 25:141; Cregg et al, Mol. Cell. Biol. (1985) 5:3376; U.S. Patent Nos. 
4,837,148 and 4,929,555; Beach and Nurse, Nature (1981) 500:706; Davidow et al, Curr. 
Genet. (1985) 70:380; Gaillardin et al, Curr. Genet. (1985) 70:49; Ballance et al, Biochem. 
10 Biophys. Res. Commun. (1983) 772:284-289; Tilburn et al, Gene (1983) 25:205-221; Yelton 
et al, Proc. Natl. Acad. Sci. (USA) (1984) 57 :1470-1474; Kelly and Hynes, EMBOJ. (1985) 
4:475479; EP 0 244,234; and WO 91/00357. 

Tnsect Cells. Expression of heterologous genes in insects is accomplished as 
described in U.S. Patent No. 4,745,051; Friesen et al, "The Regulation of Baculovirus Gene 
15 Expression", in: The Molecular Biology OfBaculoviruses (1986) (W. Doerfler, ed.); EP 0 
127,839; EP 0 155,476; and Vlak et al, J. Gen. Virol. (1988) 69:765-776; Miller et al, Ann. 
Rev. Microbiol. (1988) 42:177; Carbonell et al, Gene (1988) 73:409; Maeda et al, Nature 
(1985) 3 7 5:592-594; Lebacq-Verheyden et al, Mol. Cell. Biol (1988) 5:3 129; Smith et al, 
Proc. Natl. Acad Sci. (USA) (1985) 52:8844; Miyajima et al, Gene (1987) 55:273; and 
20 Martin et al, DNA (1988) 7:99. Numerous baculoviral strains and variants and 

corresponding permissive insect host cells from hosts are described in Luckow et al, 
Bio/Technology (1988) 5:47-55, Miller et al, Generic Engineering (1986) 5:277-279, and 
Maeda et al, Nature (1 985) 375:592-594. 

Mammalian Cells. Mammalian expression is accomplished as described in Dijkema 
25 et al, EMBO J. (1985) 4:761 , Gorman et al, Proc. Natl. Acad. Sci. (USA) (1982) 79:6777, 
Bosharter al, Cell (1985) 47:521 and U.S. Patent No. 4,399,216. Other features of 
mammalian expression are facilitated as described in Ham and Wallace, Meth. Enz. (1979) 
55:44, Barnes and Sato, Anal. Biochem. (1980) 702:255, U.S. Patent Nos. 4,767,704, 
4,657,866, 4,927,762, 4,560,655, WO 90/103430, WO 87/00195, and U.S. RE 30,985. 
30 Polynucleotide molecules comprising a polynucleotide sequence provided herein 

propagated by placing the molecule in a vector. Viral and non-viral vectors are used, 
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including plasmids. The choice of plasmid will depend on the type of cell in which 
propagation is desired and the purpose of propagation. Certain vectors are useful for 
amplifying and making large amounts of the desired DNA sequence. Other vectors are 
suitable for expression in cells in culture. Still other vectors are suitable for transfer and 
expression in cells in a whole animal or person. The choice of appropriate vector is well 
within the skill of the art. Many such vectors are available commercially. The partial or 
full-length polynucleotide is inserted into a vector typically by means of DNA ligase 
attachment to a cleaved restriction enzyme site in the vector. Alternatively, the desired 
nucleotide sequence can be inserted by homologous recombination in vivo. Typically this is 
accomplished by attaching regions of homology to the vector on the flanks of the desired 
nucleotide sequence. Regions of homology are added by ligation of oligonucleotides, or by 
polymerase chain reaction using primers comprising both the region of homology and a 
portion of the desired nucleotide sequence, for example. 

The polynucleotides set forth in SEQ ID NOS:l-844 or their corresponding full- 
length polynucleotides are linked to regulatory sequences as appropriate to obtain the desired 
expression properties. These can include promoters (attached either at the 5' end of the sense 
strand or at the 3' end of the antisense strand), enhancers, terminators, operators, repressors, 
and inducers. The promoters can be regulated or constitutive. In some situations it may be 
desirable to use conditionally active promoters, such as tissue-specific or developmental 
stage-specific promoters. These are linked to the desired nucleotide sequence using the 
techniques described above for linkage to vectors. Any techniques known in the art can be 
* used. 

When any of the above host cells, or other appropriate host cells or organisms, are 
used to replicate and/or express the polynucleotides or nucleic acids of the invention, the 
resulting replicated nucleic acid, RNA, expressed protein or polypeptide, is within the scope 
of the invention as a product of the host cell or organism. The product is recovered by any 
appropriate means known in the art. 

Once the gene corresponding to a selected polynucleotide is identified, its expression 
can be regulated in the cell to which the gene is native. For example, an endogenous gene of 
a cell can be regulated by an exogenous regulatory sequence as disclosed in U.S. Patent No. 
5,641,670. 
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HI. Identification of Functional ar H Structural Motifs of Novel Genes 

A . Sheening Polynucleotide Sequences and A mino Acid Sequences Against 
Publicly Available Databases 
5 Translations of the nucleotide sequence of the provided polynucleotides, cDNAs or 

full genes can be aligned with individual known sequences. Similarity with individual 
sequences can be used to determine the activity of the polypeptides encoded by the 
polynucleotides of the invention. For example, sequences that show similarity with a 
chemokine sequence can exhibit chemokine activities. Also, sequences exhibiting similarity 
10 with more than one individual sequence can exhibit activities that are characteristic of either 
or both individual sequences. 

The full length sequences and fragments of the polynucleotide sequences of the 
nearest neighbors can be used as probes and primers to identify and isolate the full length 
sequence corresponding to provided polynucleotides. The nearest neighbors can indicate a 
1 5 tissue or cell type to be used to construct a library for the full-length sequences 
corresponding to the provided polynucleotides.. 

Typically, a selected polynucleotide is translated in all six frames to determine the 
best alignment with the individual sequences. The sequences disclosed herein in the 
Sequence Listing are in a 5' to 3' orientation and translation in three frames can be sufficient 
20 (with a few specific exceptions as described in the Examples). These amino acid sequences 
are referred to, generally, as query sequences, which will be aligned with the individual 
sequences. Databases with individual sequences are described in "Computer Methods for 
Macromolecular Sequence Analysis" Methods in Enzymology (1996) 266, Doolittle, 
Academic Press, Inc., a division of Harcourt Brace & Co., San Diego, California, USA. 
25 Databases include Genbank, EMBL, and DN A Database of Japan (DDBJ). 

Query and individual sequences can be aligned using the methods and computer 
programs described above, and include BLAST, available over the world wide web at 
htt p://ww.ncbi.nlTn nih.gov/BLAST/ . Another alignment algorithm is Fasta, available in the 
Genetics Computing Group (GCG) package, Madison, Wisconsin, USA, a wholly owned 
30 subsidiary of Oxford Molecular Group, Inc. Other techniques for alignment are described in 
Doolittle, supra. Preferably, an alignment program that permits gaps in the sequence is 
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utilized to align the sequences. The Smith-Waterman is one type of algorithm that permits 

gaps in sequence alignments. See Meth. Mol Biol. (1997) 70: 173-187. Also, the GAP 
program using the Needleman and Wunsch alignment method can be utilized to align 
sequences. An alternative search strategy uses MPSRCH software, which runs on a 
MASPAR computer. MPSRCH uses a Smith-Waterman algorithm to score sequences on a 
massively parallel computer. This approach improves ability to identify sequences that are 
distantly related matches, and is especially tolerant of small gaps and nucleotide sequence 
errors. Amino acid sequences encoded by the provided polynucleotides can be used to 
search both protein and DNA databases. 

Results of individual and query sequence alignments can be divided into three 
categories, high similarity, weak similarity, and no similarity. Individual alignment results 
ranging from high similarity to weak similarity provide a basis for determining polypeptide 
activity and/or structure. Parameters for categorizing individual results include: percentage 
of the alignment region length where the strongest alignment is found, percent sequence 
identity, and p value. 

The percentage of the alignment region length is calculated by counting the number 
of residues of the individual sequence found in the region of strongest alignment, e.g., 
contiguous region of the individual sequence that contains the greatest number of residues 
that are identical to the residues of the corresponding region of the aligned query sequence. 
This number is divided by the total residue length of the query sequence to calculate a 
percentage. For example, a query sequence of 20 amino acid residues might be aligned with 
a 20 amino acid region of an individual sequence. The individual sequence might be 
identical to amino acid residues 5, 9-15, and 17-19 of the query sequence. Theregionof 
strongest alignment is thus the region stretching from residue 9-19, an 1 1 amino acid stretch. 
The percentage of the alignment region length is: 1 1 (length of the region of strongest 
alignment) divided by (query sequence length) 20 or 55%. 

Percent sequence identity is calculated by counting the number of amino acid 
matches between the query and individual sequence and dividing total number of matches by 
the number of residues of the individual sequences found in the region of strongest 
alignment. Thus, the percent identity in the example above would be 10 matches divided by 
1 1 amino acids, or approximately, 90.9% 
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P value is the probability that the alignment was produced by chance. For a single 
alignment, the p value can be calculated according to Karlin et al, Proc. Natl. Acad. Sci. 
(1990) 57:2264 and Karlin et al., Proc. Natl. Acad. Sci. (1993) 90. The p value of multiple 
alignments using the same query sequence can be calculated using an heuristic approach 
5 described in Altschul et al.,Nat. Genet. (1994) 6:1 19. Alignment programs such as BLAST 
program can calculate the p value. 

Another factor to consider for determining identity or similarity is the location of the 
similarity or identity. Strong local alignment can indicate similarity even if the length of 
alignment is short. Sequence identity scattered throughout the length of the query sequence 
1 0 also can indicate a similarity between the query and profile sequences. The boundaries of 
the region where the sequences align can be determined according to Doolittle, supra; 
BLAST or FAST programs; or by determining the area where sequence identity is highest. 

High Similarity. In general, in alignment results considered to be of high similarity, 
the percent of the alignment region length is typically at least about 55% of total length 
1 5 query sequence; more typically, at least about 58%; even more typically; at least about 60% 
of the total residue length of the query sequence. Usually, percent length of the alignment 
region can be as much as about 62%; more usually, as much as about 64%; even more 
usually, as much as about 66%. Further, for high similarity, the region of alignment, 
typically, exhibits at least about 75% of sequence identity; more typically, at least about 
20 78%; even more typically; at least about 80% sequence identity. Usually, percent sequence 
identity can be as much as about 82%; more usually, as much as about 84%; even more 

usually, as much as about 86%. 

The p value is used in conjunction with these methods. If high similarity is found, 
the query sequence is considered to have high similarity with a profile sequence when the p 

25 value is less than or equal to about 10" 2 ; more usually; less than or equal to about 10 3 ; even 
more usually; less than or equal to about 10" 4 . More typically, the p value is no more than 
about lO' 5 ; more typically; no more than or equal to about 10-'°; even more typically; no 
more than or equal to about 10 ,s for the query sequence to be considered high similarity. 
Weak Similarity. In general, where alignment results considered to be of weak 

30 similarity, there is no minimum percent length of the alignment region nor minimum length 
of alignment. A better showing of weak similarity is considered when the region of 
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alignment is, typically, at least about 15 amino acid residues in length; more typically, at 

least about 20; even more typically; at least about 25 amino acid residues in length. Usually, 
length of the alignment region can be as much as about 30 amino acid residues; more 
usually, as much as about 40; even more usually, as much as about 60 amino acid residues. 
Further, for weak similarity, the region of alignment, typically, exhibits at least about 35% of 
sequence identity; more typically, at least about 40%; even more typically; at least about 
45% sequence identity. Usually, percent sequence identity can be as much as about 50%; 
more usually, as much as about 55%; even more usually, as much as about 60%. 

If low similarity is found, the query sequence is considered to have weak similarity 
with a profile sequence when the p value is usually less than or equal to about 1 0" 2 ; more 
usually; less than or equal to about 10 3 ; even more usually; less than or equal to about 10" 4 . 
More typically, the p value is no more than about 10" 5 ; more usually; no more than or equal 
to about 10- ,0 ; even more usually; no more than or equal to about 10 15 for the query sequence 
to be considered weak similarity. 

Similarity Determined by Sequence Identify Alnn. Sequence identity alone can be 
used to determine similarity of a query sequence to an individual sequence and can indicate 
the activity of the sequence. Such an alignment, preferably, permits gaps to align sequences. 
Typically, the query sequence is related to the profile sequence if the sequence identity over 
the entire query sequence is at least about 15%; more typically, at least about 20%; even 
more typically, at least about 25%; even more typically, at least about 50%. Sequence 
identity alone as a measure of similarity is most useful when the query sequence is usually, 
at least 80 residues in length; more usually, 90 residues; even more usually, at least 95 amino 
acid residues in length. More typically, similarity can be concluded based on sequence 
identity alone when the query sequence is preferably 100 residues in length; more preferably, 
120 residues in length; even more preferably, 150 amino acid residues in length. 

Determining Activity from Alignment, wit h Profile and Multiple Aligned S^ ,™^ 
Translations of the provided polynucleotides can be aligned with amino acid profiles that 
define either protein families or common motifs. Also, translations of the provided 
polynucleotides can be aligned to multiple sequence alignments (MSA) comprising the 
polypeptide sequences of members of protein families or motifs. Similarity or identity with 
profile sequences or MSAs can be used to determine the activity of the gene products (e.g., 
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polypeptides) encoded by the provided polynucleotides or corresponding cDNA or genes. 
For example, sequences that show an identity or similarity with a chemokine profile or MSA 
can exhibit chemokine activities. 

Profiles can designed manually by (1) creating an MSA, which is an alignment of the 
5 amino acid sequence of members that belong to the family and (2) constructing a statistical 
representation of the alignment. Such methods are described, for example, in Bimey et al, 
Nucl. Acid Res. (1996) 24(14): 2730-2739. MSAs of some protein families and motifs are 
publicly available. For example, htt T v//oenome.wustl.edu/Pfam/ includes MSAs of 547 
different families and motifs. These MSAs are described also in Sonnhammer et al, 
1 0 Proteins ( 1 997) 28: 405-420. Other sources over the world wide web include the site at 
T,tt p-//www.emb1-h ft ide1her g .de/a rPn S / a 1i/ali.html: alternatively, a message can be sent to 
AT .l^HMBL-HFTDFJ .BERG.DE for the information. A brief description of these MSAs is 
reported in Pascarella et al, Prot. Eng. (1996) 9^:249-251 . Techniques for building 
profiles from MSAs are described in Sonnhammer et al., supra; Birney et al., supra; and 
15 "Computer Methods for Macromolecular Sequence Analysis," Methods in Enzymology 

(1996) 266, Doolittle, Academic Press, Inc., a division of Harcourt Brace & Co., San Diego, 
California, USA. 

Similarity between a query sequence and a protein family or motif can be determined 
by (a) comparing the query sequence against the profile and/or (b) aligning the query 
20 sequence with the members of the family or motif. Typically, a program such as Searchwise 
is used to compare the query sequence to the statistical representation of the multiple 
alignment, also known as a profile. The program is described in Birney et al., supra. Other 
techniques to compare the sequence and profile are described in Sonnhammer et al, supra 
and Doolittle, supra. 

25 Next, methods described by Feng et al, J. Mol. Evol. (1987) 25:351 and Higgins et 

al, CABIOS (1989) 5:151 can be used align the query sequence with the members of a 
family or motif, also known as a MSA. Computer programs, such as PILEUP, can be used. 
See Feng et al, infra. In general, the following factors are used to determine if a similarity 
between a query sequence and a profile or MSA exists: (1) number of conserved residues 

30 found in the query sequence, (2) percentage of conserved residues found in the query 
sequence, (3) number of frameshifts, and (4) spacing between conserved residues. 
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borne alignment programs that both translate and align sequences can make any 

number of frameshifts when translating the nucleotide sequence to produce the best 
alignment. The fewer frameshifts needed to produce an alignment, the stronger the 
similarity or identity between the query and profile or MSAs. For example, a weak 
similarity resulting from no frameshifts can be a better indication of activity or structure of a 
query sequence, than a strong similarity resulting from two frameshifts. Preferably, three or 
fewer frameshifts are found in an alignment; more preferably two or fewer frameshifts; even 
more preferably, one or fewer frameshifts; even more preferably, no frameshifts are found in 
an alignment of query and profile or MSAs. 

Conserved residues are those amino acids found at a particular position in all or some 
of the family or motif members. For example, most chemokines contain four conserved 
cysteines. Alternatively, a position is considered conserved if only a certain class of amino 
acids is found in a particular position in all or some of the family members. For example, 
the N-terminal position can contain a positively charged amino acid, such as lysine, arginine, 
15 or histidine. 

Typically, a residue of a polypeptide is conserved when a class of amino acids or a 
single amino acid is found at a particular position in at least about 40% of all class members; 
more typically, at least about 50%; even more typically, at least about 60% of the members. ' 
Usually, a residue is conserved when a class or single amino acid is found in at least about 
70% of the members of a family or motif; more usually, at least about 80%; even more 
usually, at least about 90%; even more usually, at least about 95%. 

A residue is considered conserved when three unrelated amino acids are found at a 
particular position in the some or all of the members; more usually, two unrelated amino 
acids. These residues are conserved when the unrelated amino acids are found at particular 
positions in at least about 40% of all class member; more typically, at least about 50%; even 
more typically, at least about 60% of the members. Usually, a residue is conserved when a 
class or single amino acid is found in at least about 70% of the members of a family or motif; 
more usually, at least about 80%; even more usually, at least about 90%; even more usually, ' 
at least about 95%. 

A query sequence has similarity to a profile or MSA when the query sequence 
comprises at least about 25% of the conserved residues of the profile or MSA; more usually, 
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at least about 30%; even more usually; at least about 40%. Typically, the query sequence 

has a stronger similarity to a profile sequence or MSA when the query sequence comprises at 
least about 45% of the conserved residues of the profile or MSA; more typically, at least 
about 50%; even more typically; at least about 55%. 
5 b. Screening Polynucleotide and Ami no Acid Sequences Against Protein 

Profiles 

The identify and function of the gene that correlates to a polynucleotide described 
herein can be determined by screening the polynucleotides or their corresponding amino acid 
sequences against profiles of protein families. Such profiles focus on common structural 

10 motifs among proteins of each family. Publicly available profiles are described above in 
Section IV A. Additional or alternative profiles are described below. 

In comparing a novel polynucleotide with known sequences, several alignment tools 
are available. Examples include PileUp, which creates a multiple sequence alignment, and is 
described in Feng et al, J. Mol Evol. (1987) 25:351. Another method, GAP, uses the 

15 alignment method of Needleman et al, J. Mol Biol. (1970) 48:443. GAP is best suited for 
global alignment of sequences. A third method, BestFit, functions by inserting gaps to 
maximize the number of matches using the local homology algorithm of Smith et al , Adv. 
Appl. Math. (1981) 2:482. Exemplary protein profiles are provided below and in the 
examples. 

20 Chemokines. Chemokines are a family of proteins that have been implicated in 

lymphocyte trafficking, inflammatory diseases, angiogenesis, hematopoiesis, and viral 
infection. See, for example, Rollins, Blood (1997) 90^:909-928, and Wells et al., J. Leuk. 
Biol. (1997) 67:545-550. U.S. Patent No. 5,605,817 discloses DNA encoding a chemokine 
expressed in fetal spleen. U.S. Patent No. 5,656,724 discloses chemokine-like proteins and 

25 methods of use. U.S. Patent No. 5,602,008 discloses DNA encoding a chemokine expressed 
by liver. 

Chemokine mutants are polypeptides having an amino acid sequence that possesses 
at least one amino acid substitution, addition, or deletion as compared to native chemokines. 
Fragments possess the same amino acid sequence of the native chemokines; mutants can 
30 lack the amino and/or carboxyl terminal sequences. Fusions are mutants, fragments, or 

native chemokines that also include amino and/or carboxyl terminal amino acid extensions. 
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I he number or type of the amino acid changes is not critical, nor is the length or 
number of the amino acid deletions, or amino acid extensions that are incorporated in the 
chemokines as compared to the native chemokine amino acid sequences. A polynucleotide 
encoding one of these variant polypeptides will retain at least about 80% amino acid identity 
5 with at least one known chemokine. Preferably, these polypeptides will retain at least about 
85% amino acid sequence identity, more preferably, at least about 90%; even more 
preferably, at least about 95%. In addition, the variants exhibit at least 80%; preferably 
about 90%; more preferably about 95% of at least one activity exhibited by a native 
chemokine, which includes immunological, biological, receptor binding, and signal 
10 transduction functions. 

Assays for chemotaxis relating to neutrophils are described in Walz et al, Biochem. 
Biophys. Res. Commun. (1987) 149:755, Yoshimura et al, Proc. Natl. Acad. Set. (USA) 

(1987) 5*9233, and Schroder et al, J. Immunol (1987) 75P:3474; to lymphocytes, Larsen et 
al, Science (1989) 245:1464, Carr et al, Proc. Natl Acad. Set (USA) (1994) P7:3652; to 
tumor-infiltrating lymphocytes, Liao et al, J. Exp. Med '(1995). 752:1301; to hematopoietic 
progenitors, Aiuti et al, J. Exp. Med. (1997) 755:11 1; to monocytes, Valente et al, Biochem. 

(1988) 27:4162; and to natural killer cells, Loetscher et al, J. Immunol. (1996) 755:322, and 
Allavena et al, Eur. J. Immunol (1994) 24:3233. 

Assays for determining the biological activity of attracting eosinophils are described 
in Dahinden J. Exp. Med. (1994) 77P:751, Weber etal,J. Immunol. (1995) 754:4166, 
and Noso et al, Biochem. Biophys. Res. Commun. (1994) 200:1470; for attracting dendritic 
cells, Sozzani et al, J. Immunol. (1995) 755:3292; for attracting basophils, in Dahinden et 
al, J. Exp. Med. (1994) 77P:751, Alam et al, J. Immunol. (1994) 752:1298, Alam et al, J. 
Exp. Med. (1 992) 7 76:78 1 ; and for activating neutrophils, Maghazaci et al, Eur. J. Immunol 
(1996)25:315, and Taub a/., J Immunol. (1995)755:3877. Native chemokines can act as 
mitogens for fibroblasts, assayed as described in Mullenbach et al, J. Biol. Chem. (1986) 
257:719. 

Native chemokines exhibit binding activity with a number of receptors. Description 
of such receptors and assays to detect binding are described in, for example, Murphy et al, 
Science (1991) 255:1280; Combadiere et al, J. Biol. Chem. (1995) 270:29671; Daugherty er 
al, J. Exp. Med. (1 996) 755:2349; Samson et al, Biochem. (1996) 55:3362; Raport et al, J. 
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BiTcZm™996) 277:17161; Combadiere et al, J. Leukoc. Biol. (1996) (50:147; Baba et 
al, J. Biol. Chem. (1997) 25:14893; Yosida et al., J. Biol. Chem. (1997) 272:13803; 
Arvannitakis et al, Nature (1997) 385:347, and other assays are known in the art. 

Assays for kinase activation of chemokines are described by Yen et al, J. Leukoc. 
5 Biol. (1997) 67:529; Dubois et al, J. Immunol. (1996) 756:1356; Turner et al, J. Immunol. 
(1995) 755:2437. Assays for inhibition of angiogenesis or cell proliferation are described in 
Maione et al, Science (1990) 247:77. Glycosaminoglycan production can be induced by 
native chemokines, assayed as described in Castor et al, Proc. Natl. Acad. Sci. (USA) (1983) 
«0:765. Chemokine-mediated histamine release from basophils is assayed as described in 
10 Dahinden et al, J. Exp. Med. (1989) 770:1787; and White et al, Immunol. Lett. (1989) 
22:151. Heparin binding is described in Luster et al, J. Exp. Med. (1995) 752:219. 

Chemokines can possess dimerization activity, which can be assayed according to 
Burrows et al, Biochem. (1994) 33:12741; and Zhang et al, Mol. Cell Biol. (1995) 75:4851. 
Native chemokines can play a role in the inflammatory response of viruses. This activity 
15 can be assayed as described in Bleul et al, Nature (1996) 352:829; and Oberlin et al, Nature 
(1996) 352:833. Exocytosis of monocytes can be promoted by native chemokines. The 
assay for such activity is described in Uguccioni et al, Eur. J. Immunol. (1995) 25:64. 
Native chemokines also can inhibit hematopoietic stem cell proliferation. The method for 
testing for such activity is reported in Graham et al, Nature (1990) 344:442. 
20 rvsth remain Proteins. Several protein families contain death domain motifs 

(Feinstein and Kimchi, TIBS Letters (1995) 20:242). Some death domain containing 
proteins are implicated in cytotoxic intracellular signaling (Cleveland et al, Cell (1995) 
57:479, Pan et al, Science (1997) 276:1 1 1; Duan et al, Nature (1997) 355:86-89, and 
Chinnaiyan et al, Science (1996) 274:990). U.S. Patent No. 5,563,039 describes a protein 
25 homologous to TRADD (Tumor Necrosis Factor Receptor-1 Associated Death Domain 
containing protein), and modifications of the active domain of TRADD that retain the 
functional characteristics of the protein, as well as apoptosis assays for testing the function of 
such death domain containing proteins. U.S. Patent No. 5,658,883 discloses biologically 
active TGF-B1 peptides. U.S. Patent No. 5,674,734 discloses RIP, which contains a C- 
30 terminal death domain and an N-terminal kinase domain. 
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Leukemia Inhibitory Factor (UF) An LIF profile is constructed from sequences of 

leukemia inhibitor factor, CT-1 (cardiotropic- 1), CNTF (ciliary neurotrophic factor), OSM 
(oncostatin M), and IL-6 (interleukin-6). This profile encompasses a family of secreted 
cytokines that have pleiotropic effects on many cell types including hepatocytes, osteoclasts, 
neuronal cells and cardiac myocytes, and can be used to detect additional genes encoding 
such proteins. These molecules are all structurally related and share a common co-receptor 
g P 130 which mediates intracellular signal transduction by cytoplasmic tyrosine kinases such 
as src. 

Novel proteins related to this family are also likely to be secreted, to activate gpl30 
and to function in the development of a variety of cell types. Thus new members of this 
family would be candidates to be developed as growth or survival fact™ f«r tu, ~n _ 

~ ~~ «*w WW* A KJ pWO 

that they stimulate. For more details on this family of cytokines, see Pennica et al, Cytokine 
and Growth Factor Reviews (1996) 7:81-91. U.S. Patent No. 5,420,247 discloses LIF 
receptor and fusion proteins. U.S. Patent No. 5,443,825 discloses human LIF. 

Angiopoietin. Angiopoietin-1 is a secreted ligand of the TIE-2 tyrosine kinase; it 
functions as an angiogenic factor critical for normal vascular development. Angiopoietin-2 is 
a natural antagonist of angiopoietin-1 and thus functions as an anti-angiogenic factor. These 
two proteins are structurally similar and activate the same receptor (Folkman et al., Cell 
(1996) 87:1 153, and Davis et al., Cell (1996) 57:1 161). The angiopoietin molecules are 
composed of two domains: a coiled-coil region and a region related to fibrinogen. The 
fibrinogen domain is found in many molecules including ficolin and tesascin, and is well 
defined structurally with many members. 

Receptor Protein-Tyrosine Kinases Receptor Protein-Tyrosine Kinases or RPTKs 
are described in Lindberg, Annu. Rev. Cell Biol. (1994) 70:251-337. 

Growth Factors: (Epidermal Growth F a .tnr) F,QF and (Fihrnh ^t Growth F»rt«r) 
FOR For a discussion of growth factor superfamilies, see Growth Factors: A Practical 
Approach, (Appendix Al) (1993) McKay and Leigh, Oxford University Press, NY, 237-243. 
U.S. Patent No. 4,444,760 discloses acidic brain fibroblast growth factor, which isactive in 
the promotion of cell division and wound healing. U.S. Patent No. 5,439,818 discloses DNA 
encoding human recombinant basic fibroblast growth factor, which is active in wound 
healing. U.S. Patent No. 5,604,293 discloses recombinant human basic fibroblast growth 
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factor, which is useful for wound healing. U.S. Patent No. 5,410,832 discloses bram-denved 

and recombinant acidic fibroblast growth factor, which act as mitogens for mesoderm and 

neuroectoderm-derived cells in culture, and promote wound healing in soft tissue, 

cartilaginous tissue and musculoskeletal tissue. U.S. Patent No. 5,387,673 discloses 

5 biologically active fragments of FGF. 

Pmrin. oftbeTNF Family. A profile derived from the TNF family is created by 
aligning sequences of the following TNF family members: nerve growth factor (NGF), 
lymphotoxin, Fas ligand, tumor necrosis factor (TNFa), CD40 ligand, TRAIL, ox40 ligand, 
4-1BB ligand, CD27 ligand, and CD30 ligand. The profile is designed to identify sequences 

10 of proteins that constitute new members or homologues of this family of proteins. U.S. 
Patent No. 5,606,023 discloses mutant TNF proteins; U.S. Patent No. 5,597,899 and U.S. 
Patent No. 5,486,463 disclose TNF muteins; and U.S. Patent No. 5,652,353 discloses DNA 

encoding TNFa muteins. 

Members of the TNF family of proteins have been show in vitro to multimerize, as 
15 described in Burrows et al, Biochem. (1994) 55:12741 and Zhang et al, Mol Cell. Biol. 
(1 995) 7 5:485 1 and bind receptors as described in Browning et al. , J. Immunol. (1 994) 
747:1230, Androlewicz et al, J. Biol. CW(1992) 257:2542, and Crowe et al, Science 
(1994)264:707. 

In vivo, TNFs proteolytically cleave a target protein as described in Kriegel et al, 
20 Cell (1988) 55:45 and Mohler etal, Nature (1994) 570:218 and demonstrate cell 

proliferation and differentiation activity. T-cell or thymocyte proliferation is assayed as 
described in Armitage et al, Eur. J. Immunol. (1992) 22:447; Current Protocols in 
Immunology, ed. J.E. Coligan et al, 3.1-3.19; Takai et al.,J. Immunol (1986) 757:3494- 
3500, Bertagnoli et al, J. Immunol (1990) 745:1706, Bertagnoli et al, J. Immunol (1991) 
25 755:327, Bertagnoli et al, J. Immunol. (1992) 749:3778, and Bowman et al, J. Immunol 
(1994) 7 52:1756. B cell proliferation and Ig secretion are assayed as described in 
Maliszewski, J. Immunol (1990) 744:3028, and Assays for B Cell Function: In Vitro 
Antibody Production. Mond and Brunswick, Current Protocols in Immunol., Coligan Ed vol 
1 pp 3.8.1-3.8.16, John Wiley and Sons, Toronto 1994, Kehrl et al, Science (1987) 255:1 144 
30 and Boussiotis et al.,PNAS USA (1994) 97:7007. Other in vivo activities include 

upregulation of cell surface antigens, upregulation of costimulatory molecules, and cellular 
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aggregation/adhesion as described in Barrett et a!., J. Immunol. (1991) 74*1722; Bjorck et 
al, Eur. J. Immunol (1993) 25:1771; Clark et al.,Annu Rev. Immunol. (1991) 9:97; 
Ranheim et al, J. Exp. Med. (1994) 777:925; Yellin, J. Immunol (1994) 755:666; and Gruss 
et al, Blood (1994) 84:2305. 

Proliferation and differentiation of hematopoietic and lymphopoietic cells has also 
been shown in vivo for TNFs, using assays for embryonic differentiation and hemopoiesis 
as described in Johansson et al, Cellular Biology (1995) 75:141, Keller et al, Mol. Cell. 
Biol. (1993) 75:473, McClanahan et al, Blood (1993) 57:2903 and using assays to detect 
stem cell survival and differentiation as described in Culture of Hematopoietic Cells, 
Freshney et al eds, pp 1-21, 23-29, 139-162, 163-179, and 265-268, Wiley-Liss, Inc., New 
York, NY, 1994, and Hirajama et al, PNAS USA (1992) <SP:5907. 

In vivo activities of TNFs also include lymphocyte survival and apoptosis, assayed as 
described in Darcynkewicz et al, Cytometry (1992) 75:795; Gorczca et al, Leukemia (1993) 
7:659; Itoh et al, Cell (1991) 6*233; Zacharduk, J. Immunol (1990) 745:4037; Zamai et 
al, Cytometry (1993) 74:891; and Gorczyca etal, Int'U. Oncol. (1992) 7:639. Some 
members of the TNF family are cleaved from the cell surface; others remain membrane 
bound. The three-dimensional structure of TNF is discussed in Sprang and Eck, Tumor 
Necrosis Factors; supra. 

TNF proteins include a transmembrane domain. The protein is cleaved into a shorter 
soluble version, as described in Kriegler et al, Cell (1988) 55:45, Perez et al, Cell (1990) 
65:251, and Shaw et al, Cell (1986) 4*659. The transmembrane domain is between amino 
acid 46 and 77 and the cytoplasmic domain is between position 1 and 45 on the human form 
ofTNFa. The 3-dimensional motifs of TNF include a sandwich of two pleated p sheets. 
Each sheet is composed of anti-parallel p strands, p strands facing each other on opposite 
sites of the sandwich are connected by short polypeptide loops, as described in Van Ostade et 
al, Protein Engineering (1994) 7(1):5, and Sprang et al, Tumor Necrosis Factors; supra 
Residues of the TNF family proteins that are involved in the p sheet secondary structure 
have been identified as described in Van Ostade et al, Protein Eng. (1994) 7^:5, and 
Sprang et al, supra. 

TNF receptors are disclosed in U.S. Patent No. 5,395,760. A profile derived from the 
TNF receptor family is created by aligning sequences of the TNF receptor family, including 
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ApoTas^FR I and II, death receptor 3 (DR3), CD40, ox40, CD27, and CD30. Thus, the 
profile is designed to identify from the polynucleotides of the invention sequences of 
proteins that constitute new members or homologues of this family of proteins. 

Tumor necrosis factor receptors exist in two forms in humans: p55 TNFR and p75 

5 TNFR, both of which provide intracellular signals upon binding with a ligand. The 

extracellular domains of these receptor proteins are cysteine rich. The receptors can remain 
membrane bound, although some forms of the receptors are cleaved forming soluble 
receptors. The regulation, diagnostic, prognostic, and therapeutic value of soluble TNF 
receptors is discussed in Aderka, Cytokine and Growth Factor Reviews, (1996) 7^:231. 

10 POOF Family. U.S. Patent No. 5,326,695 discloses platelet derived growth factor 

agonists; bioactive portions of PDGF-B are used as agonists. U.S. Patent No. 4,845,075 
discloses biologically active B-chain homodimers, and also includes variants and derivatives 
of the PDGF-B chain. U.S. Patent No. 5,128,321 discloses PDGF analogs and methods of 
use. Proteins having the same bioactivity as PDGF are disclosed, including A and B chain 

15 proteins. 

if in a *P rTncludin P MKTO Family. U.S. Patent No. 5,650,501 discloses 
serine/threonine kinase, associated with mitotic and meiotic cell division; the protein has a 
kinase domain in its N-terminal and 3 PEST regions in the C-terminus. U.S. Patent No. 
5,605,825 discloses human PAK65, a serine protein kinase. 
20 ' The foregoing discussion provides a few examples of the protein profiles that can be 
compared with the polynucleotides of the invention. One skilled in the art can use these and 
other protein profiles to identify the genes that correlate with the provided polynucleotides. 
C. Identification nf Secreted & Membrane-Bound Polypeptides 
Both secreted and membrane-bound polypeptides of the present invention are of 
25 particular interest. For example, levels of secreted polypeptides can be assayed in body 

fluids that are convenient, such as blood, urine, prostatic fluid and semen. Membrane-bound 
polypeptides are useful for constructing vaccine antigens or inducing an immune response. 
Such antigens would comprise all or part of the extracellular region of the membrane-bound 
polypeptides. Because both secreted and membrane-bound polypeptides comprise a 
30 fragment of contiguous hydrophobic amino acids, hydrophobic^ predicting algorithms can 
be used to identify such polypeptides. 
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A signal sequence » usually encoded by both secreted and membrane-bound 
polypeptide genes to direct a polypeptide to the surface of the cell. The signal sequence 
usually comprises a stretch of hydrophobic residues. Such signal sequences can fold into 
helical structures. Membrane-bound polypeptides typically comprise at least one 
5 transmembrane region that possesses a stretch of hydrophobic amino acids that can 
transverse the membrane. Some transmembrane regions also exhibit a helical structure. 
Hydrophobic fragments within a polypeptide can be identified by using computer 
algorithms. Such algorithms include Hopp & Woods, Proc. Natl. Acad. Sci. USA (1981) 
75:3824-3828; Kyte & Doolittle, y. Mol. Biol. (1982) 157: 105-132; and RAOAR algorithm, 
1 0 Degli Esposti et al, Eur. J. Biochem. (1 990) 1 90: 207-2 1 9. 

Another method of identifying secreted and membrane-bound polypeptides is to 
translate the polynucleotides of the invention in all six frames and determine if at least 8 
contiguous hydrophobic amino acids are present. Those translated polypeptides with at least 
8; more typically, 10; even more typically, 12 contiguous hydrophobic amino acids are 
considered to be either a putative secreted or membrane bound polypeptide. Hydrophobic 
amino acids include alanine, glycine, histidine, isoleucine, leucine, lysine, methionine, 
phenylalanine, proline, threonine, tryptophan, tyrosine, and valine. 

IV - Verification of the Function of an Pxnressinn Prnd.w o f a Fnll-T^ th rw, 
20 Correspondi ng to a Polynucleotide 

Ribozymes, antisense constructs, and dominant negative mutants can be used to 
determine function of the expression product of a gene corresponding to a polynucleotide 
provided herein. These methods and compositions are particularly useful where the provided 
novel polynucleotide exhibits no significant or substantial homology to a sequence encoding 
a gene of known function. Antisense molecules and ribozymes can be constructed from 
synthetic polynucleotides. Typically, the phosphoramidite method of oligonucleotide 
synthesis is used. See Beaucage etal., Tet. Lett. (1981) 22:1859 and U.S. Patent No. 
4,668,777. Automated devices for synthesis are available to create oligonucleotides using 
this chemistry. Examples of such devices include Biosearch 8600, Models 392 and 394 by 
30 Applied Biosystems, a division of Perkin-Elmer Corp., Foster City, California, USA; and 
Expedite by Perceptive Biosystems, Framingham, Massachusetts, USA. Synthetic RNA, 
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phosphate analog oligonucleotides, and chemically derivatized oligonucleotides can also be 
produced, and can be covalently attached to other molecules. RNA oligonucleotides can be 
synthesized, for example, using RNA phosphoramidites. This method can be performed on 
an automated synthesizer, such as Applied Biosystems, Models 392 and 394, Foster City, 
California, USA. See Applied Biosystems User Bulletin 53 and Ogilvie et ah, Pure & 

Applied Chem. (1987) 59:325. 

Phosphorothioate oligonucleotides can also be synthesized for antisense construction. 
A sulfurizing reagent, such as tetraethylthiruam disulfide (TETD) in acetonitrile can be used 
to convert the internucleotide cyanoethyl phosphite to the phosphorothioate triester within 15 
minutes at room temperature. TETD replaces the iodine reagent, while all other reagents 
used for standard phosphoramidite chemistry remain the same. Such a synthesis method can 
be automated using Models 392 and 394 by Applied Biosystems, for example. 

Oligonucleotides of up to 200 nucleotides can be synthesized, more typically, 100 
nucleotides, more typically 50 nucleotides; even more typically 30 to 40 nucleotides. These 
synthetic fragments can be annealed and ligated together to construct larger fragments. See, 
for example, Sambrook et ah, supra. 

A. Ribozvmes 

Trans-cleaving catalytic RNAs (ribozymes) are RNA molecules possessing 
endoribonuclease activity. Ribozymes are specifically designed for a particular target, and 
the target message must contain a specific nucleotide sequence. They are engineered to 
cleave any RNA species site-specifically in the background of cellular RNA. The cleavage 
event renders the mRNA unstable and prevents protein expression. Importantly, ribozymes 
can be used to inhibit expression of a gene of unknown function for the purpose of 
determining its function in an in vitro or in vivo context, by detecting the phenotypic effect. 

One commonly used ribozyme motif is the hammerhead, for which the substrate 
sequence requirements are minimal. Design of the hammerhead ribozyme is disclosed in 
Usman et al, Current Opin. Struct. Biol. (1996) 6:527. Usman also discusses the 
therapeutic uses of ribozymes. Ribozymes can also be prepared and used as described in 
Long etah, FASEBJ. (1993) 7:25; Symons, Ann. Rev. Biochem. (1992) 67:641; Perrottae/ 
ah, Biochem. (1992) 37:16; Ojwang et al, Proc. Natl. Acad. Sci. (USA) (1992) 59:10802; 
and U.S. Patent No. 5,254,678. Ribozyme cleavage of HIV-I RNA is described in U.S. 
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Patent No. 5,144,019; methods of cleaving RNA using ribozymes is described in U.S. 
Patent No. 5,1 16,742; and methods for increasing the specificity of ribozymes are described 
in U.S. Patent No. 5,225,337 and Koizumi et al., Nucleic Acid Res. (1989) 77:7059. 
Preparation and use of ribozyme fragments in a hammerhead structure are also described by 
Koizumi etal, Nucleic Acids Res. (1989)77:7059. Preparation and use of ribozyme 
fragments in a hairpin structure are described by Chowrira and Burke, Nucleic Acids Res. 
(1992) 20:2835. Ribozymes can also be made by rolling transcription as described in 
Daubendiek and Kool, Nat. Biotechnol. (1997) 15(3):213. 

The hybridizing region of the ribozyme can be modified or can be prepared as a 
branched structure as described in Horn and Urdea, Nucleic Acids Res. (1989) 77:6959. The 
basic structure of the ribozymes can also be chemically altered in ways familiar to those 
skilled in the art, and chemically synthesized ribozymes can be administered as synthetic 
oligonucleotide derivatives modified by monomeric units. In a therapeutic context, liposome 
mediated delivery of ribozymes improves cellular uptake, as described in Birikh et al, Eur. 
1 5 J. Biochem. ( 1 997) 245: 1 . 

Using the polynucleotide sequences of the invention and methods known in the art, 
ribozymes are designed to specifically bind and cut the corresponding mRNA species. 
Ribozymes thus provide a means to inhibit the expression of any of the proteins encoded by 
the disclosed polynucleotides or their full-length genes. The full-length gene need not be 
20 known in order to design and use specific inhibitory ribozymes. In the case of a 

polynucleotide or full-length cDNA of unknown function, ribozymes corresponding to that 
nucleotide sequence can be tested in vitro for efficacy in cleaving the target transcript. 
Those ribozymes that effect cleavage in vitro are further tested in vivo. The ribozyme can 
also be used to generate an animal model for a disease, as described in Birikh et al., supra. 
An effective ribozyme is used to determine the function of the gene of interest by blocking 
its transcription and detecting a change in the cell. Where the gene is found to be a mediator 
in a disease, an effective ribozyme is designed and delivered in a gene therapy for blocking 
transcription and expression of the gene. 

Therapeutic and functional genomic applications of ribozymes proceed beginning 
with knowledge of a portion of the coding sequence of the gene to be inhibited. Thus, for 
many genes, a partial polynucleotide sequence provides adequate sequence for constructing 
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an effective ribozyme. A target cleavage site is selected in the target sequence, and a 

ribozyme is constructed based on the 5' and 3' nucleotide sequences that flank the cleavage 

site. Retroviral vectors are engineered to express monomeric and multimeric hammerhead 

ribozymes targeting the mRNA of the target coding sequence. These monomeric and 

5 multimeric ribozymes are tested in vitro for an ability to cleave the target mRNA. A cell line 
is stably transduced with the retroviral vectors expressing the ribozymes, and the 
transduction is confirmed by Northern blot analysis and reverse-transcription polymerase 
chain reaction (RT-PCR). The cells are screened for inactivation of the target mRNA by 
such indicators as reduction of expression of disease markers or reduction of the gene 

1 0 product of the target mRNA. 

B. Antisense 

Antisense nucleic acids are designed to specifically bind to RNA, resulting in the 
formation of RNA-DNA or RNA-RNA hybrids, with an arrest of DNA replication, reverse 
transcription or messenger RNA translation. Antisense polynucleotides based on a selected 

1 5 polynucleotide sequence can interfere with expression of the corresponding gene. Antisense 
polynucleotides are typically generated within the cell by expression from antisense 
constructs that contain the antisense strand as the transcribed strand. Antisense 
polynucleotides based on the disclosed polynucleotides will bind and/or interfere with the 
translation of mRNA comprising a sequence complementary to the antisense polynucleotide. 

20 The expression products of control cells and cells treated with the antisense construct are 

compared to detect the protein product of the gene corresponding to the polynucleotide upon 
which the antisense construct is based. The protein is isolated and identified using routine 

biochemical methods. 

One rationale for using antisense methods to determine the function of the gene 

25 corresponding to a disclosed polynucleotide is the biological activity of antisense 

therapeutics. Antisense therapy for a variety of cancers is in clinical phase and has been 
discussed extensively in the literature. Reed reviewed antisense therapy directed at the Bcl-2 
gene in tumors; gene transfer-mediated overexpression of Bcl-2 in tumor cell lines conferred 
resistance to many types of cancer drugs. (Reed, 3.C., N.C.I. (1997)59:988). The potential 

30 for clinical development of antisense inhibitors of ras is discussed by Cowsert, L.M., Anti- 
Cancer Drug Design (1997) 72:359. Additional important antisense targets include 
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leukemia (Geurtz, A.M., Anti-Cancer Drug Design (1997) 72:341); human C^ST' 0 
(Monia, B.P., Anti-Cancer Drug Design (1 997) 72:327); and protein kinase C (McGraw et 
al, Anti-Cancer Drug Design (1997) 72:315. 

Given the extensive background literature and clinical experience in antisense 
therapy, one skilled in the art can use selected polynucleotides of the invention as additional 
potential therapeutics. The choice of polynucleotide can be narrowed by first testing them 
for binding to "hot spot" regions of the genome of cancerous cells. If a polynucleotide is 
identified as binding to a "hot spot", testing the polynucleotide as an antisense compound in 
the corresponding cancer cells clearly is warranted. 

Ogunbiyi et ai, Gastroenterology (1 997) 113(3):16\ describe prognostic use of 
allelic loss in colon cancer; Barks et al., Genes, Chromosomes, and Cancer (1997) 79^:278 
describe increased chromosome copy number detected by FISH in malignant melanoma; 
Nishizake et al., Genes, Chromosomes, and Cancer (1997) 19(4):261 describe genetic ' 
alterations in primary breast cancer and their metastases and direct comparison using 
modified comparative genome hybridization; and Elo et al., Cancer Research (1997) 
57(16)3356 disclose that loss of heterozygosity at 16z24.1-q24.2 is significantly associated 
with metastatic and aggressive behavior of prostate cancer. 
C Dominant Negative Mutations 

As an alternative method for identifying function of the gene corresponding to a 
polynucleotide disclosed herein, dominant negative mutations are readily generated for 
corresponding proteins that are active as homomultimers. A mutant polypeptide will interact 
with wild-type polypeptides (made from the other allele) and form a non-functional 
multimer. Thus, a mutation is in a substrate-binding domain, a catalytic domain, or a 
cellular localization domain. Preferably, the mutant polypeptide will be overproduced. 
Point mutations are made that have such an effect. In addition, fusion of different 
polypeptides of various lengths to the terminus of a protein can yield dominant negative 
mutants. General strategies are available for making dominant negative mutants (see, e g. 
Herskowitz, Nature (mi) 329:219). Such techniques can be used to create loss of function 
mutations, which are useful for determining protein function. 
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V. Construction of Polypeptide s of the Invention and Variants Tnereol 

The polypeptides of the invention include those encoded by the disclosed 
polynucleotides. These polypeptides can also be encoded by nucleic acids that, by virtue of 
the degeneracy of the genetic code, are not identical in sequence to the disclosed 
polynucleotides. Thus, the invention includes within its scope a polypeptide encoded by a 
polynucleotide having the sequence of any one of SEQ ID NOS: 1-844 or a variant thereof. 

In general, the term "polypeptide" as used herein refers to both the full length 
polypeptide encoded by the recited polynucleotide, the polypeptide encoded by the gene 
represented by the recited polynucleotide, as well as portions or fragments thereof. 
"Polypeptides" also includes variants of the naturally occurring proteins, where such 
variants are homologous or substantially similar to the naturally occurring protein, and can 
be of an origin of the same or different species as the naturally occurring protein (e.g., 
human, murine, or some other species that naturally expresses the recited polypeptide, 
usually a mammalian species). In general, variant polypeptides have a sequence that has at 
least about 80%, usually at least about 90%, and more usually at least about 98% sequence 
identity with a differentially expressed polypeptide of the invention, as measured by BLAST 
using the parameters described above. The variant polypeptides can be naturally or non- 
naturally glycosylated, i.e., the polypeptide has a glycosylation pattern that differs from the 
glycosylation pattern found in the corresponding naturally occurring protein. 

The invention also encompasses homologs of the disclosed polypeptides (or 
fragments thereof) where the homologs are isolated from other species, i.e. other animal or 
plant species, where such homologs, usually mammalian species, e.g. rodents, such as mice, 
rats; domestic animals, e.g., horse, cow, dog, cat; and humans. By homolog is meant a 
polypeptide having at least about 35%, usually at least about 40% and more usually at least 
about 60% amino acid sequence identity a particular differentially expressed protein as 
identified above, where sequence identity is determined using the BLAST algorithm, with 
the parameters described supra. 

In general, the polypeptides of the subject invention are provided in a non-naturally 
occurring environment, e.g. are separated from their naturally occurring environment. In 
certain embodiments, the subject protein is present in a composition that is enriched for the 
protein as compared to a control. As such, purified polypeptide is provided, where by 
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purified is meant that the protein is present in a composition that is substantially free of non- 
differentially expressed polypeptides, where by substantially free is meant that less than 
90%, usually less than 60% and more usually less than 50% of the composition is made up 
of non-differentially expressed polypeptides. 

Also within the scope of the invention are variants; variants of polypeptides include 
mutants, fragments, and fusions. Mutants can include amino acid substitutions, additions or 
deletions. The amino acid substitutions can be conservative amino acid substitutions or 
substitutions to eliminate non-essential amino acids, such as to alter a glycosylate site, a 
phosphorylation site or an acetylation site, or to minimize misfolding by substitution or ' 
deletion of one or more cysteine residues that are not necessary for function. Conservative 
amino acid substitutions are those that preserve the general charge, 
hydrophobicity/hydrophilicity, and/or steric bulk of the amino acid substituted. For 
example, substitutions between the following groups are conservative: Gly/Ala, Val/Ile/Leu, 
Asp/Glu, Lys/Arg, Asn/Gln, Ser/Cys, Thr, and Phe/Trp/Tyr. 

Variants can be designed so as to retain biological activity of a particular region of 
the protein {e.g., a functional domain and/or, where the polypeptide is a member of a protein 
family, a region associated with a consensus sequence). In a non-limiting example, Osawa et 
al, Biochem. Mol. Int. (1994) 5*1003, discusses the actin binding region of aprotein from 
several different species. The actin binding regions of the these species are considered 
homologous based on the fact that they have amino acids that fall within "homologous 
residue groups." Homologous residues are judged according to the following groups (using 
single letter amino acid designations): STAG; ILVMF; HRK; DEQN; and FYW. For 
example, and S, a T, an A or a G can be in a position and the function (in this case actin 
binding) is retained. 

Additional guidance on amino acid substitution is available from studies of protein 
evolution. Go et al, Int. J. Peptide Protein Res. (1980) 75:21 1, classified amino acid residue 
sites as interior or exterior depending on their accessibility. More frequent substitution on 
exterior sites was confirmed to be general in eight sets of homologous protein families 
regardless of their biological functions and the presence or absence of a prosthetic group. 
Virtually all types of amino acid residues had higher mutabilities on the exterior than in the 
interior. No correlation between mutability and polarity was observed of amino acid 
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resWuesTn^e interior and exterior, respectively. Amino acid residues were classified into 
one of three groups depending on their polarity: polar (Arg, Lys, His, Gin, Asn, Asp, and 
Glu); weak polar (Ala, Pro, Gly, Thr, and Ser), and nonpolar (Cys, Val, Met, He, Leu, Phe, 
Tyr, and Trp). Amino acid replacements during protein evolution were very conservative: 
5 88% and 76% of them in the interior or exterior, respectively, were within the same group of 
the three. Inter-group replacements are such that weak polar residues are replaced more 
often by nonpolar residues in the interior and more often by polar residues on the exterior. 

Additional guidance for production of polypeptide variants is provided in Querol et 
al, Prot. Eng. (1996) 9:265, which provides general rules for amino acid substitutions to 
10 enhance protein thermostability. New glycosylation sites can be introduced as discussed in 
Olsen and Thomsen, J. Gen. Microbiol. (1991) 137:519. An additional disulfide bridge can 
be introduced, as discussed by Perry and Wetzel, Science (1984) 226:555; Pantoliano et al, 
Biochemistry (1987) 26:2077; Matsumura et al, Nature (1989) 342:291; Nishikawa et al, 
Protein Eng. (1990) 3:443; Takagi et al, J. Biol Chem. (1990) 265:6874; Clarke et al, 
15 Biochemistry (1993) 32:4322; and Wakarchuk et al, Protein Eng. (1994) 7:1379. Metal 
binding sites can be introduced, according to Toma et al, Biochemistry (1991) 30:97, and 
Haezerbrouck et al, Protein Eng. (1993) 6:643. Substitutions with prolines in loops can be 
made according to Masul et al, Appl. Env. Microbiol. (1994) 60:3579; and Hardy et al, 
FEBSLett. 377:89. 

20 Cysteine-depleted muteins are considered variants within the scope of the invention. 

These variants can be constructed according to methods disclosed in U.S. Patent No. 
4,959,314, which discloses substitution of cysteines with other amino acids, and methods 
for assaying biological activity and effect of the substitution. Such methods are suitable for 
proteins according to this invention that have cysteine residues suitable for such 

25 substitutions, for example to eliminate disulfide bond formation. 

Variants also include fragments of the polypeptides disclosed herein, particularly 
biologically active fragments and/or fragments corresponding to functional domains. 
Fragments of interest will typically be at least about 10 aa to at least about 15 aa in length, 
usually at least about 50 aa in length, and can be as long as 300 aa in length or longer, but 
30 will usually not exceed about 1000 aa in length, where the fragment will have a stretch of 
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amino acids that is identical to a polypeptide encoded by a polynucleotide h^™ 98727610 



a 



25 



30 



sequence of any SEQ ID NOS:l-844, or a homolog thereof. 

The protein variants described herein are encoded by polynucleotides that are within 
the scope of the invention. The genetic code can be used to select the appropriate codons to 
construct the corresponding variants. 

VI - Computer-Related EmhnHiWntc 

In general, a library of polynucleotides is a collection of sequence information, which 
information is provided in either biochemical form (e.g., as a collection of polynucleotide 
molecules), or in electronic form (e.g., as a collection of polynucleotide sequences stored in a 
computer-readable form, as in a computer system and/or as part of a computer proeram). 
The sequence information of the polynucleotides can be used in a variety of ways, e.g., as a 
resource for gene discovery, as a representation of sequences expressed in a selected cell 
type (e.g., cell type markers), and/or as markers of a given disease or disease state. In 
general, a disease marker is a representation of a gene product that is present in all affected 
by disease either at an increased or decreased level relative to a normal cell (e.g., a cell of the 
same or similar type that is not substantially affected by disease). For example, a 
polynucleotide sequence in a library can be a polynucleotide that represents an mRNA, 
polypeptide, or other gene product encoded by the polynucleotide, that is either 
overexposed or underexpressed in a breast ductal cell affected by cancer relative to a 
normal (i.e., substantially disease-free) breast cell. 

The nucleotide sequence information of the library can be embodied in any suitable 
form, e.g., electronic or biochemical forms. For example, a library of sequence information 
embodied in electronic form includes an accessible computer data file (or, in biochemical 
form, a collection of nucleic acid molecules) that contains the representative nucleotide 
sequences of genes that are differentially expressed (e.g. , overexposed or underexpressed) 
as between, for example, i) a cancerous cell and a normal cell; ii) a cancerous cell and a 
dysplastic cell; hi) a cancerous cell and a cell affected by a disease or condition other than 
cancer; iv) a metastatic cancerous cell and a normal cell and/or non-metastatic cancerous 
cell; v) a malignant cancerous cell and a non-malignant cancerous cell (or a normal cell) 
and/or vi) a dysplastic cell relative to a normal cell. Other combinations and comparisons of 
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cells affected by various diseases or stages of disease will be readily apparent to the 

ordinarily skilled artisan. Biochemical embodiments of the library include a collection of 
nucleic acids that have the sequences of the genes in the library, where the nucleic acids can 
correspond to the entire gene in the library or to a fragment thereof, as described in greater 
5 detail below. 

The polynucleotide libraries of the subject invention include sequence information of 
a plurality of polynucleotide sequences, where at least one of the polynucleotides has a 
sequence of any of SEQ ID NOS:l-844. By plurality is meant at least 2, usually at least 3 
and can include up to all of SEQ ID NOS:l-844. The length and number of polynucleotides 
10 in the library will vary with the nature of the library, e.g., if the library is an oligonucleotide 
array, a cDNA array, a computer database of the sequence information, etc. 

Where the library is an electronic library, the nucleic acid sequence information can 
be present in a variety of media. "Media" refers to a manufacture, other than an isolated 
nucleic acid molecule, that contains the sequence information of the present invention. Such 
15 a manufacture provides the genome sequence or a subset thereof in a form that can be 

examined by means not directly applicable to the sequence as it exists in a nucleic acid. For 
example, the nucleotide sequence of the present invention, e.g. the nucleic acid sequences of 
any of the polynucleotides of SEQ ID NOS: 1 -844, can be recorded on computer readable 
media, e.g. any medium that can be read and accessed directly by a computer. Such media 
20 include, but are not limited to: magnetic storage media, such as a floppy disc, a hard disc 
storage medium, and a magnetic tape; optical storage media such as CD-ROM; electrical 
storage media such as RAM and ROM; and hybrids of these categories such as 
magnetic/optical storage media. One of skill in the art can readily appreciate how any of the 
presently known computer readable mediums can be used to create a manufacture 
25 comprising a recording of the present sequence information. "Recorded" refers to a process 
for storing information on computer readable medium, using any such methods as known in 
the art. Any convenient data storage structure can be chosen, based on the means used to 
access the stored information. A variety of data processor programs and formats can be used 
for storage, e.g. word processing text file, database format, etc. In addition to the sequence 
30 information, electronic versions of the libraries of the invention can be provided in 

conjunction or connection with other computer-readable information and/or other types of 
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computer-readable files (e.g., searchable files, executable files, etc, including, but not limited 
to, for example, search program software, etc.). 

By providing the nucleotide sequence in computer readable form, the information can 
be accessed for a variety of purposes. Computer software to access sequence information is 
publicly available. For example, the BLAST (Altschul et al, supra.) and BLAZE (Brutlag 
et al. Camp. Chem. (1993) 17:203) search algorithms on a Sybase system can be used 
identify open reading frames (ORFs) within the genome that contain homology to ORFs 
from other organisms. 

As used herein, "a computer-based system" refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware of the computer-based systems of the oresent 
invention comprises a central processing unit (CPU), input means, output means, and data 
storage means. A skilled artisan can readily appreciate that any one of the currently 
available computer-based system are suitable for use in the present invention. The data 
storage means can comprise any manufacture comprising a recording of the present sequence 
information as described above, or a memory access means that can access such a 
manufacture. 

"Search means" refers to one or more programs implemented on the computer-based 
system, to compare a target sequence or target structural motif with the stored sequence 
information. Search means are used to identify fragments or regions of the genome that 
match a particular target sequence or target motif. A variety of known algorithms are 
publicly known and commercially available, e.g. MacPattern (EMBL), BLASTN and 
BLASTX (NCBI). A "target sequence" can be any DNA or amino acid sequence of six or 

more nucleotides or two or more amino acids, preferably from about lOto 100 amino acids 
25 or from about 30 to 300 nucleotide residues. 

A "target structural motif," or "target motif," refers to any rationally selected 
sequence or combination of sequences in which the sequences) are chosen based on a 
three-dimensional configuration that is formed upon the folding of the target motif, or on 
consensus sequences of regulatory or active sites. There are a variety of target motifs known 
in the art. Protein target motifs include, but arc not limited to, enzyme active sites and signal 
sequences. Nucleic acid target motifs include, but are not limited to, hairpin structures, 
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promoter sequences and other expression elements such as binding sites for transcripts 

factors. 

A variety of structural formats for the input and output means can be used to input 
and output the information in the computer-based systems of the present invention. One 
5 format for an output means ranks fragments of the genome possessing varying degrees of 
homology to a target sequence or target motif. Such presentation provides a skilled artisan 
with a ranking of sequences and identifies the degree of sequence similarity contained in the 
identified fragment. 

A variety of comparing means can be used to compare a target sequence or target 
10 motif with the data storage means to identify sequence fragments of the genome. A skilled 
artisan can readily recognize that any one of the publicly available homology search 
programs can be used as the search means for the computer based systems of the present 

invention. 

As discussed above, the "library" of the invention also encompasses biochemical 
15 libraries of the polynucleotides of SEQ ID NOS: 1-844, e.g., collections of nucleic acids 
representing the provided polynucleotides. The biochemical libraries can take a variety of 
forms, e.g., a solution of cDNAs, a pattern of probe nucleic acids stably associated with a 
surface of a solid support (i.e., an array) and the like. Of particular interest are nucleic acid 
arrays in which one or more of SEQ ID NOS:l-844 is represented on the array. By array is 
20 meant a an article of manufacture that has at least a substrate with at least two distinct 

nucleic acid targets on one of its surfaces, where the number of distinct nucleic acids can be 
considerably higher, typically being at least 10 nt, usually at least 20 nt and often at least 25 
nt. A variety of different array formats have been developed and are known to those of skill 
in the art, including those described in 5,242,974; 5,384,261 ; 5,405,783; 5,412,087; 
25 5,424,186; 5,429,807; 5,436,327; 5,445,934; 5,472,672; 5,527,681; 5,529,756; 5,545,531; 
5,554,501; 5,556,752; 5,561,071; 5,599,895; 5,624,711; 5,639,603; 5,658,734; WO 
93/17126; WO 95/11995; WO 95/35505; EP 742287; and EP 799897. The arrays of the 
subject invention find use in a variety of applications, including gene expression analysis, 
drug screening, mutation analysis and the like, as disclosed in the above-listed exemplary 
3 0 patent documents. 
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In addition to the above nucleic acid libraries, analogous libraries of polypeptides are 
also provided, where the where the polypeptides of the library will represent at least a 
portion of the polypeptides encoded by SEQ ID NOS: 1-844. 

VII. Utilities 

A - Use of Polynucleotide Probes in Manning, and in Tissue Profiling 
Polynucleotide probes, generally comprising at least 12 contiguous nucleotides of a 
polynucleotide as shown in the Sequence Listing, are used for a variety of purposes, such as 
chromosome mapping of the polynucleotide and detection of transcription levels. Additional 
disclosure about preferred regions of the disclosed polynucleotide sequences is found in the 
Examples. A probe that hybridizes specifically to a polynucleotide disclosed herein should 
provide a detection signal at least 5-, i 0-, or 20-fold higher than the background 
hybridization provided with other unrelated sequences. 

Probes in Detection of Expression T eve], Nucleotide probes are used to detect 
expression of a gene corresponding to the provided polynucleotide. The references describe 
an example of a sandwich nucleotide hybridization assay. For example, in Northern blots, 
mRNA is separated electrophoretically and contacted with a probe. A probe is detected as 
hybridizing to an mRNA species of a particular size. The amount of hybridization is 
quantitated to determine relative amounts of expression, for example under a particular 
condition. Probes are also used to detect products of amplification by polymerase chain 
reaction. The products of the reaction are hybridized to the probe and hybrids are detected. 
Probes are used for in situ hybridization to cells to detect expression. Probes can also be 
used in vivo for diagnostic detection of hybridizing sequences. Probes are typically labeled 
with a radioactive isotope. Other types of detectable labels can be used such as 
chromophores, fluors, and enzymes. Other examples of nucleotide hybridization assays are 
described in WO92/02526 and U.S. Patent No. 5,124,246. 

Alternatively, the Polymerase Chain Reaction (PCR) is another means for detecting 
small amounts of target nucleic acids (see, e.g., Mullis et ah, Meth. Enzymol. (1987) 
755:335; U.S. Patent No. 4,683,195; and U.S. Patent No. 4,683,202). Two primer 
polynucleotides nucleotides hybridize with the target nucleic acids and are used to prime the 
reaction. The primers can be composed of sequence within or 3' and 5' to the 
polynucleotides of the Sequence Listing. Alternatively, if the primers are 3' and 5' to these 
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polynuclSes, they need not hybridize to them or the complements. A thermostable 
polymerase creates copies of target nucleic acids from the primers using the original target 
nucleic acids as a template. After a large amount of target nucleic acids is generated by the 
polymerase, it is detected by methods such as Southern blots. When using the Southern blot 
5 method, the labeled probe will hybridize to a polynucleotide of the Sequence Listing or 
complement. 

Furthermore, mRN A or cDNA can be detected by traditional blotting techniques 
described in Sambrook et al, "Molecular Cloning: A Laboratory Manual" (New York, Cold 
Spring Harbor Laboratory, 1989). mRNA or cDNA generated from mRNA using a 
1 0 polymerase enzyme can be purified and separated using gel electrophoresis. The nucleic 
acids on the gel are then blotted onto a solid support, such as nitrocellulose. The solid 
support is exposed to a labeled probe and then washed to remove any unhybridized probe. 
Next, the duplexes containing the labeled probe are detected. Typically, the probe is labeled 
with radioactivity. 

1 5 Mapping. Polynucleotides of the present invention are used to identify a 

chromosome on which the corresponding gene resides. Such mapping can be useful in 
identifying the function of the polynucleotide-related gene by its proximity to other genes 
with known function. Function can also be assigned to the polynucleotide-related gene when 
particular syndromes or diseases map to the same chromosome. For example, use of 
20 polynucleotide probes in identification and quantification of nucleic acid sequence 
aberrations is described in U.S. Patent No. 5,783,387. 

For example, fluorescence in situ hybridization (FISH) on normal metaphase spreads 
facilitates comparative genomic hybridization to allow total genome assessment of changes 
in relative copy number of DNA sequences. See Schwartz and Samad, Curr. Opin. 
25 Biotechnol. (1994) 8:70; Kallioniemi et al, Sem. Cancer Biol (1993) 4M; Valdes et al, 
Methods in Molecular Biology (1997) 68:1, Boultwood, ed., Human Press, Totowa, NJ. 
Preparations of human metaphase chromosomes are prepared using standard cytogenetic 
techniques from human primary tissues or cell lines. Nucleotide probes comprising at least 
12 contiguous nucleotides selected from the nucleotide sequence shown in the Sequence 
30 Listing are used to identify the corresponding chromosome. The nucleotide probes are 

labeled, for example, with a radioactive, fluorescent, biotinylated, or chemiluminescent label, 
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and detected by well known methods appropriate for the particular label sele'ctefpTtocols 
for hybridizing nucleotide probes to preparations of metaphase chromosomes are also well 
known in the art. A nucleotide probe will hybridize specifically to nucleotide sequences in 
the chromosome preparations that are complementary to the nucleotide sequence of the 
5 probe. 

Polynucleotides are mapped to particular chromosomes using, for example, radiation 
hybrids or chromosome-specific hybrid panels. See Leach ./ al, Advances in Genetics, 
(1995) 53:63-99; Walter et al, Nature Genetics (1994) 7:22; Walter and Goodfellow Trends 
in Genetics (1992) P:352. Panels for radiation hybrid mapping are available from Research 
10 Genetics, Inc., Huntsville, Alabama, USA. Databases for markers using various panels are 
avaUable via the world wide web at http:/F/shgc-www.stanford.edu; and httn://www. 
genome.wi.mit edu/rpi-bin/comi P /rhm,p p.r P |. The statistical program RHMAP can be used 
to construct a map based on the data from radiation hybridization with a measure of the 
relative likelihood of one order versus another. RHMAP is available via the world wide web 
1 5 at http://www.sph.umich.edu/group/statgen/software. 

In addition, commercial programs are available for identifying regions of 
chromosomes commonly associated with disease, such as cancer. Polynucleotides based on 
the polynucleotides of the invention can be used to probe these regions. For example, if 
through profile searching a provided polynucleotide is identified as corresponding to a gene 
encoding a kinase, its ability to bind to a cancer-related chromosomal region will suggest its 
role as a kinase in one or more stages of tumor cell development/growth. Although some 
experimentation would be required to elucidate the role, the polynucleotide constitutes a new 
material for isolating a specific protein that has potential for developing a cancer diagnostic 
or therapeutic. 

Tissue Typinp or Profllinp Expression of specific mRNA corresponding to the 
provided polynucleotides can vary in different cell types and can be tissue-specific This 
variation of mRNA levels in different cell types can be exploited with nucleic acid probe 
assays to determine tissue types. For example, PGR, branched DNA probe assays, or 
blotting techniques utilizing nucleic acid probes substantially identical or complementary to 
polynucleotides listed in the Sequence Listing can determine the presence or absence of the 
corresponding cDNA or mRNA. 
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WO Tor^xampte, a metastatic lesion is identified by its developmental organ or tissue 
source by identifying the expression of a particular marker of that organ or tissue. If a 
polynucleotide is expressed only in a specific tissue type, and a metastatic lesion is found to 
express that polynucleotide, then the developmental source of the lesion has been identified. 
5 Expression of a particular polynucleotide is assayed by detection of either the corresponding 
mRNA or the protein product. Immunological methods, such as antibody staining, are used 
to detect a particular protein product. Hybridization methods can be used to detect particular 
mRNA species, including but not limited to in situ hybridization and Northern blotting. 

iTcnfFnlvmorphisms. A polynucleotide of the invention will be useful in forensics, 
10 genetic analysis, mapping, and diagnostic applications if the corresponding region of a gene 
is polymorphic in the human population. Particular polymorphic forms of the provided 
polynucleotides can be used to either identify a sample as deriving from a suspect or rule out 
the possibility that the sample derives from the suspect. Any means for detecting a 
polymorphism in a gene are used, including but not limited to electrophoresis of protein 
15 polymorphic variants, differential sensitivity to restriction enzyme cleavage, and 
hybridization to allele-specific probes. 
B. Antibody Production 

Expression products of a polynucleotide of the invention, the corresponding mRNA 
or cDNA, or the corresponding complete gene are prepared and used for raising antibodies 

20 for experimental, diagnostic, and therapeutic purposes. For polynucleotides to which a 

corresponding gene has not been assigned, this provides an additional method of identifying 
the corresponding gene. The polynucleotide or related cDNA is expressed as described 
above, and antibodies are prepared. These antibodies are specific to an epitope on the 
polypeptide encoded by the polynucleotide, and can precipitate or bind to the corresponding 

25 native protein in a cell or tissue preparation or in a cell-free extract of an in vitro expression 
system. 

Immunogens for raising antibodies are prepared by mixing the polypeptides encoded 
by the polynucleotides of the present invention with adjuvants. Alternatively, polypeptides 
are made as fusion proteins to larger immunogenic proteins. Polypeptides are also 
30 covalently linked to other larger immunogenic proteins, such as keyhole limpet hemocyanin. 
Immunogens are typically administered intradermal^, subcutaneously, or intramuscularly. 
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Immunogens are administered to experimental animals such as rabbits, sheep, and mice, to 
generate antibodies. Optionally, the animal spleen cells are isolated and fused with myeloma 
cells to form hybridomas which secrete monoclonal antibodies. Such methods are well 
known in the art. According to another method known in the art, the selected polynucleotide 
i is administered directly, such as by intramuscular injection, and expressed in vivo. The 
expressed protein generates a variety of protein-specific immune responses, including 
production of antibodies, comparable to administration of the protein. 

Preparations of polyclonal and monoclonal antibodies specific for polypeptides 
encoded by a selected polynucleotide are made using standard methods known in the art. 
The antibodies specifically bind to epitopes present in the polypeptides encoded by 
polynucleotides disclosed in the Sequence Listing. Typically, at least 6, 8. 10. or 12 
contiguous amino acids are required to form an epitope. However, epitopes which involve 
non-contiguous amino acids may require more, for example at least 15, 25, or 50 amino 
acids. A short sequence of a polynucleotide may then be unsuitable for use as an epitope to 
raise antibodies for identifying the corresponding novel protein, because of the potential for 
cross-reactivity with a known protein. However, the antibodies can be useful for other 
purposes, particularly if they identify common structural features of a known protein and a 
novel polypeptide encoded by a polynucleotide of the invention. 

Antibodies that specifically bind to human polypeptides encoded by the provided 
polypeptides should provide a detection signal at least 5-, 1 0-, or 20-fold higher than a 
detection signal provided with other proteins when used in Western blots or other 
immunochemical assays. Preferably, antibodies that specifically polypeptides of the 
invention do not bind to other proteins in immunochemical assays at detectable levels and 
can immunoprecipitate the specific polypeptide from solution. 

To test for the presence of serum antibodies to the polypeptide of the invention 
human population, human antibodies are purified by methods well known in the 
Preferably, the antibodies are affinity purified by passing antiserum over a column to which 
the corresponding selected polypeptide or fusion protein is bound. The bound antibodies , 
then be eluted from the column, for example using a buffer with a high salt concentration. 
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W ° "^addition to the antibodies discussed above, genetically engineered antibody 
derivatives are made, such as single chain antibodies, according to methods well known in 
the art. 

C. T Tsr of Polynucleotides to Construct Arrays for Diagnostics 
Polynucleotide arrays provide a high throughput technique that can assay a large 
number of polynucleotide sequences in a sample. This technology can be used as a 
diagnostic and as a tool to test for differential expression to determine function of an 
encoded protein. Arrays can be created by spotting polynucleotide probes onto a substrate 
{e.g., glass, nitrocelllose, etc.) in a two-dimensional matrix or array having bound probes. 
The probes can be bound to the substrate by either covalent bonds or by non-specific 
interactions, such as hydrophobic interactions. Samples of polynucleotides can be detectably 
labeled {e.g., using radioactive or fluorescent labels) and then hybridized to the probes. 
Double stranded polynucleotides, comprising the labeled.sample polynucleotides bound to 
probe polynucleotides, can be detected once the unbound portion of the sample is washed 
away. Techniques for constructing arrays and methods of using these arrays are described in 
EP No. 0 799 897; PCT No. WO 97/29212; PCT No. WO 97/27317; EP No. 0 785 280; PCT 
No. WO 97/02357; U.S. Pat. No. 5,593,839; U.S. Pat. No. 5,578,832; EP No. 0 728 520; 
U.S. Pat. No. 5,599,695; EP No. 0 721 016; U.S. Pat. No. 5,556,752; PCT No. WO 
95/22058; and U.S. Pat. No. 5,631,734. 

As discussed in some detail above, arrays can be used to examine differential 
expression of genes and can be used to determine gene function. For example, arrays of the 
instant polynucleotide sequences can be used to determine if any of the provided 
polynucleotides are differentially expressed between a test cell and control cell {e.g., cancer 
cells and normal cells). For example, high expression of a particular message in a cancer 
cell, which is not observed in a corresponding normal cell, can indicate a cancer specific 
protein. Exemplary uses of arrays are further described in, for example, Pappalarado et ai, 
Sem. Radiation Oncol. (1998) 5:217; and Ramsay Nature Biotechnol. (1998) 7(5:40. 
D. Differential Expression 

The polynucleotides of the invention can also be used to detect differences in 
expression levels between two cells, e.g., as a method to identify abnormal or diseased tissue 
in a human. For polynucleotides corresponding to profiles of protein families as described 
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above, the choice of tissue can be selected according to the putative biologica^ZcuoTln 
general, the expression of a gene corresponding to a specific polynucleotide is compared 
between a first tissue that is suspected of being diseased and a second, normal tissue of the 
human. The tissue suspected of being abnormal or diseased can be derived from a different 
: tissue type of the human, but preferably it is derived from the same tissue type; for example 
an intestinal polyp or other abnormal growth should be compared with normal intestinal 
tissue. The normal tissue can be the same tissue as that of the test sample, or any normal 
tissue of the patient, especially those that express the polynucleotide-related gene of interest 
(e.g., brain, thymus, testis, heart, prostate, placenta, spleen, small intestine, skeletal muscle, 
pancreas, and the mucosal lining of the colon). A difference between the polynucleotide- 
related gene, mRNA, or protein in the two tissues which are compared, for example in 
molecular weight, amino acid or nucleotide sequence, or relative abundance, indicates a 
change in the gene, or a gene which regulates it, in the tissue of the human that was 
suspected of being diseased. Examples of detection of differential expression and its use in 
diagnosis of cancer are described in U.S. Patent Nos. 5,688,641 and 5,677,125. 

The polynucleotide-related genes in the two tissues are compared by any means 
known in the art. For example, the two genes can be sequenced, and the sequence of the 
gene in the tissue suspected of being diseased compared with the gene sequence in the 
normal tissue. The genes corresponding to a provided polynucleotide, or portions thereof, in 
the two tissues are amplified, for example using nucleotide primers based on the nucleotide 
sequence shown in the Sequence Listing, using the polymerase chain reaction. The 
amplified genes or portions of genes are hybridized to detectably labeled nucleotide probes 
selected from a nucleotide sequence shown in the Sequence Listing. A difference in the 
nucleotide sequence of me isolated gene in the tissue suspected of being diseased compared 
with the normal nucleotide sequence suggests a role of the gene product encoded by the 
subject polynucleotide in the disease, and provides guidance for preparing a therapeutic 
agent. 

Alternatively, mRNA corresponding to a provided polynucleotide in the two tissues 
is compared. PolyA'RNA is isolated from the two tissues as is known in the art. For 
example, one of skill in the art can readily determine differences in the size or amount of 
mRNA transcripts between the two tissues using Northern blots and detectably labeled 
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nll^deprobes selected from the nucleotide sequence shown in the Sequence Listing. 
Increased or decreased expression of a given mRNA in a tissue sample suspected of being 
diseased, compared with the expression of the same mRNA in a normal tissue, suggests that 
the expressed protein has a role in the disease, and also provides a lead for preparing a 

5 therapeutic agent. 

The comparison can also be accomplished by analyzing polypeptides between the 
matched samples. The sizes of the proteins in the two tissues are compared, for example, 
using antibodies of the present invention to detect polypeptides in Western blots of protein 
extracts from the two tissues. Other changes, such as expression levels and subcellular 

10 localization, can also be detected immunologically, using antibodies to the corresponding 

protein. A higher or lower level of expression of a given polypeptide in a tissue suspected of 
being diseased, compared with the same protein expression level in a normal tissue, is 
indicative that the expressed protein has a role in the disease, and provides guidance for 

preparing a therapeutic agent. 

15 Similarly, comparison of polynucleotide sequences or of gene expression products, 

e.g., mRNA and protein, between a human tissue that is suspected of being diseased and a 
normal tissue of a human, are used to follow disease progression or remission in the human. 
Such comparisons are made as described above. For example, increased or decreased 
expression of a gene corresponding to an inventive polynucleotide in the tissue suspected of 

20 being neoplastic can indicate the presence of neoplastic cells in the tissue. The degree of 
increased expression of a given gene in the neoplastic tissue relative to expression of the 
same gene in normal tissue, or differences in the amount of increased expression of a given 
gene in the neoplastic tissue over time, is used to assess the progression of the neoplasia in 
that tissue or to monitor the response of the neoplastic tissue to a therapeutic protocol over 

25 time. 

The expression pattern of any two cell types can be compared, such as low and high 
metastatic tumor cell lines, malignant or non-malignant cells, or cells from tissue which have 
and have not been exposed to a therapeutic agent. A genetic predisposition to disease in a 
human is detected by comparing expression levels of an mRNA or protein corresponding to 
30 a polynucleotide of the invention in a fetal tissue with levels associated in normal fetal 
tissue. Fetal tissues that are used for this purpose include, but are not limited to, amniotic 
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fluid, chononic villi, blood, and the blastomere of an in vitro-fertilized embryo. The 

comparable normal polynucleotide-related gene is obtained from any tissue. The mRNA or 
protein is obtained from a normal tissue of a human in which the polynucleotide-related gene 
is expressed. Differences such as alterations in the nucleotide sequence or size of the same 
5 product of the fetal polynucleotide-related gene or mRNA, or alterations in the molecular 
weight, amino acid sequence, or relative abundance of fetal protein, can indicate a germline 
mutation in the polynucleotide-related gene of the fetus, which indicates a genetic 
predisposition to disease. Particular diagnostic and prognostic uses of the disclosed 
polynucleotides are described in more detail below. 
10 E Diagnostic, Propnostic, and Other Uses Based On r>;ffl> rentia | Fynrp<;g;nn 

In general, diagnostic methods of the invention for involve detection of a level or 
amount of a gene product, particularly a differentially expressed gene product, in a test 
sample obtained from a patient suspected of having or being susceptible to a disease (e.g., 
breast cancer, lung cancer, colon cancer and/or metastatic forms thereof), and comparing the 
detected levels to those levels found in normal cells (e.g., cells substantially unaffected by 
cancer) and/or other control cells (e.g., to differentiate a cancerous cell from a cell affected 
by dysplasia). Furthermore, the severity of the disease can be assessed by comparing the 
detected levels of a differentially expressed gene product with those levels detected in 
samples representing the levels of differentially gene product associated with varying 
20 degrees of severity of disease. 

The term "differentially expressed gene" is intended to encompass a polynucleotide 
that can, for example, include an open reading frame encoding a gene product (e.g., a 
polypeptide), and/or introns of such genes and adjacent 5' and 3' non-coding nucleotide 
sequences involved in the regulation of expression, up to about 20 kb beyond the coding 
region, but possibly further in either direction. The gene can be introduced into an 
appropriate vector for extrachromosomal maintenance or for integration into a host genome. 
In general, a difference in expression level associated with a decrease in expression level of 
at least about 25%, usually at least about 50% to 75%, more usually at least about 90% or 
more is indicative of a differentially expressed gene of interest, i.e., a gene that is 
underexpressed or down-regulated in the test sample relative to a control sample. 
Furthermore, a difference in expression level associated with an increase in expression of at 
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le^alout 9 25%, usually at least about 50% to 75%, more usually at least about 90% and can 
be at least about 1 K-fold, usually at least about 2-fold to about 10-fold, and can be about 
100-fold to about 1 ,000-fold increase relative to a control sample is indicative of a 
differentially expressed gene of interest, i.e., an overexpressed or up-regulated gene. 
5 "Differentially expressed polynucleotide" as used herein means a nucleic acid 

molecule (RNA or DNA) having a sequence that represents a differentially expressed gene, 
e.g., the differentially expressed polynucleotide comprises a sequence (e.g., an open reading 
frame encoding a gene product) that uniquely identifies a differentially expressed gene so 
that detection of the differentially expressed polynucleotide in a sample is correlated with the 
10 presence of a differentially expressed gene in a sample. "Differentially expressed 

polynucleotides" is also meant to encompass fragments of the disclosed polynucleotides, 
e.g., fragments retaining biological activity, as well as nucleic acids homologous, 
substantially similar, or substantially identical (e.g., having about 90% sequence identity) to 
the disclosed polynucleotides. 
15 Methods of the subject invention useful in diagnosis or prognosis typically involve 

comparison of the abundance of a selected differentially expressed gene product in a sample 
of interest with that of a control to determine any relative differences in the expression of the 
gene product, where the difference can be measured qualitatively and/or quantitatively. 
Quantitation can be accomplished, for example, by comparing the level of expression 
20 product detected in the sample with the amounts of product present in a standard curve. A 
comparison can be made visually; by using a technique such as densitometry, with or 
without computerized assistance; by preparing a representative library of cDNA clones of 
mRN A isolated from a test sample, sequencing the clones in the library to determine that 
number of cDNA clones corresponding to the same gene product, and analyzing the number 
25 of clones corresponding to that same gene product relative to the number of clones of the 
same gene product in a control sample; or by using an array to detect relative levels of 
hybridization to a selected sequence or set of sequences, and comparing the hybridization 
pattern to that of a control. The differences in expression are then correlated with the 
presence or absence of an abnormal expression pattern. A variety of different methods for 
30 determining the nucleic acid abundance in a sample are known to those of skill in the art, 
where particular methods of interest include those described in: Pietu et al. Genome Res. 



51 



10 



15 



20 



25 



30 



WO 99/33982 

noo£\<^ 7 , , „ PCT/US98/27610 

(1996) 6:492; Zhao et ai, Gene (1995) 755:207; Scares , Curr. Opin. Biotechnol. (1977) 8: 

542; Raval, J. Pharmacol Toxicol Methods (1994) 52:125; Chalifour et al, Anal. Biochem 
(1994) 216:299; Stolz ef a/., Mol Biotechnol (1996) 5:225; Hong et al.,Biosci. Reports 
(1982) 2:907; and McGraw, Anal. Biochem. (1984) 745:298. Also of interest are the 
methods disclosed in WO 97/273 17, the disclosure of which is herein incorporated by 
reference. 

In general, diagnostic assays of the invention involve detection of a gene product of a 
the polynucleotide sequence {e.g. , mRNA or polypeptide) that corresponds to a sequence of 
SEQ ID NOS: 1-844. The patient from whom the sample is obtained can be apparently 
healthy, susceptible to disease (e.g., as determined by family history or exposure to certain 
environmental factors), or can already be identified as having a condition in which altered 
expression of a gene product of the invention is implicated. 

In the assays of the invention, the diagnosis can be determined based on detected 
gene product expression levels of a gene product encoded by at least one, preferably at least 
two or more, at least 3 or more, or at least 4 or more of the polynucleotides having a 
sequence set forth in SEQ ID NOS.1-844, and can involve detection of expression of genes 
corresponding to all of SEQ ID NOS:l-844 and/or additional sequences that can serve as 
additional diagnostic markers and/or reference sequences. Where the diagnostic method is 
designed to detect the presence or susceptibility of a patient to cancer, the assay preferably 
involves detection of a gene product encoded by a gene corresponding to a polynucleotide 
that is differentially expressed in cancer. For example, a higher level of expression of a 
polynucleotide corresponding to SEQ ID NO:52 relative to a level associated with a normal 
sample can indicate the presence of cancer in the patient from whom the sample is derived. 
In another example, detection of a lower level of a polynucleotide corresponding to SEQ ID 
NO:39 relative to a normal level is indicative of the presence of cancer in the patient. 
Further examples of such differentially expressed polynucleotides are described in the 
Examples below. Given the provided polynucleotides and information regarding their 
relative expression levels provided herein, assays using such polynucleotides and detection 
of their expression levels in diagnosis and prognosis will be readily apparent to the ordinarily 
skilled artisan. 
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Any of a variety of detectable labels can be used in connection with the various 
embodiments of the diagnostic methods of the invention. Suitable detectable labels include 
fluorochromes,(e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, 
phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 2',7'-dimethoxy-4%5'- 
5 dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X-rhodamine (ROX), 6-carboxy- 

2',4',7',4,7-hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N\N'- 
tetramethyl-6-carboxyrhodamine (TAMRA)), radioactive labels, (e.g. 32 P, 3S S, 3 H, etc.), and 
the like. The detectable label can involve a two stage systems (e.g., biotin-avidin, hapten- 

anti-hapten antibody, etc.) 
10 Reagents specific for the polynucleotides and polypeptides of the invention, such as 

antibodies and nucleotide probes, can be supplied in a kit for detecting the presence of an 
expression product in a biological sample. The kit can also contain buffers or labeling 
components, as well as instructions for using the reagents to detect and quantify expression 
products in the biological sample. Exemplary embodiments of the diagnostic methods of the 
1 5 invention are described below in more detail. 

Polypeptide detprtinn in diagnosis. In one embodiment, the test sample is assayed 
for the level of a differentially expressed polypeptide. Diagnosis can be accomplished using 
any of a number of methods to determine the absence or presence or altered amounts of the 
differentially expressed polypeptide in the test sample. For example, detection can utilize 
20 staining of cells or histological sections with labeled antibodies, performed in accordance 
with conventional methods. Cells can be permeabilized to stain cytoplasmic molecules. In 
general, antibodies that specifically bind a differentially expressed polypeptide of the 
invention are added to a sample, and incubated for a period of time sufficient to allow 
binding to the epitope, usually at least about 10 minutes. The antibody can be detectably 
25 labeled for direct detection (e.g. , using radioisotopes, enzymes, fluorescers, 

chemiluminescers, and the like), or can be used in conjunction with a second stage antibody 
or reagent to detect binding (e.g., biotin with horseradish peroxidase-conjugated avidin, a 
secondary antibody conjugated to a fluorescent compound, e.g. fluorescein, rhodamine, 
Texas red, etc.). The absence or presence of antibody binding can be determined by various 
30 methods, including flow cytometry of dissociated cells, microscopy, radiography, 

scintillation counting, etc. Any suitable alternative methods can of qualitative or quantitative 
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detection of levels or amounts of differentially expressed polypeptide can be used, for 

example ELISA, western blot, immunoprecipitation, radioimmunoassay, etc. 

In general, the detected level of differentially expressed polypeptide in the test 

sample is compared to a level of the differentially expressed gene product in a reference or 

control sample, e.g., in a normal cell (negative control) or in a cell having a known disease 

state (positive control). For example, a higher level of expression of a polypeptide encoded 

by SEQ ID NO:52 relative to a level associated with a normal sample can indicate the 

presence of cancer in the patient from whom the sample is derived. In another example, 

detection of a lower level of the polypeptide encoded by SEQ ID NO:39 relative to a normal 

level is indicative of the presence of cancer in the patient. 

mRNA detection The diagnostic methods of the invention can also or alternatively 

involve detection of mPMA 

WJ „ gclIC tuucS p 0naing lo a aitierentially expressed 

polynucleotides of the invention. Any suitable qualitative or quantitative methods known in 
the art for detecting specific mRNAs can be used. mRNA can be detected by, for example, 
in situ hybridization in tissue sections, by reverse transcriptase-PCR, or in Northern blots ' 
containing poly A+ mRNA. One of skill in the art can readily use these methods to 
determine differences in the size or amount of mRNA transcripts between two samples. For 
example, the level of mRNA of the invention in a tissue sample suspected of being 
cancerous or dysplastic is compared with the expression of the mRNA in a reference sample, 
e.g., a positive or negative control sample (e.g., normal tissue, cancerous tissue, etc.). In a ' 
specific non-limiting example, a higher level of mRNA corresponding to SEQ ID NO:52 
relative to a level associated with a normal sample can indicate the presence of cancer in the 
patient from whom the sample is derived. In another example, detection of a lower level of 
mRNA corresponding to SEQ ID NO.-39 relative to a normal level is indicative of the 
25 presence of cancer in the patient. 

Any suitable method for detecting and comparing mRNA expression levels in a 
sample can be used in connection with the diagnostic methods of the invention (see, e.g., 
U.S. 5,804,382). For example, mRNA expression levels in a sample can be determined by 
generation of a library of expressed sequence tags (ESTs) from the sample, where the EST 
library is representative of sequences present in the sample (Adams, et al., (1991) Science 
252:1651). Enumeration of the relative representation of ESTs within the library can be used 
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to approximate the relative representation of the gene transcript within the starting sample. 
The results of EST analysis of a test sample can then be compared to EST analysis of a 
reference sample to determine the relative expression levels of a selected polynucleotide, 
particularly a polynucleotide corresponding to one or more of the differentially expressed 

5 genes described herein. 

Alternatively, gene expression in a test sample can be performed using serial analysis 
of gene expression (SAGE) methodology (Velculescu et al., Science (1995) 270:484). In 
short, SAGE involves the isolation of short unique sequence tags from a specific location 
within each transcript (e.g., a sequence of any one of SEQ ID NOS:l-6). The sequence tags 
10 are concatenated, cloned, and sequenced. The frequency of particular transcripts within the 
starting sample is reflected by the number of times the associated sequence tag is 
encountered with the sequence population. 

Gene expression in a test sample can also be analyzed using differential display (DD) 
methodology. In DD, fragments defined by specific sequence delimiters (e.g., restriction 
15 enzyme sites) are used as unique identifiers of genes, coupled with information about 

fragment length or fragment location within the expressed gene. The relative representation 
of an expressed gene with a sample can then be estimated based on the relative 
representation of the fragment associated with that gene within the pool of all possible 
fragments. Methods and compositions for carrying out DD are well known in the art, see, 
20 e.g., U.S. 5,776,683; and U.S. 5,807,680. 

Alternatively, gene expression in a sample using hybridization analysis, which is 
based on the specificity of nucleotide interactions. Oligonucleotides or cDNA can be used to 
selectively identify or capture DNA or RN A of specific sequence composition, and the 
amount of RNA or cDNA hybridized to a known capture sequence determined qualitatively 
25 or quantitatively, to provide information about the relative representation of a particular 
message within the pool of cellular messages in a sample. Hybridization analysis can be 
designed to allow for concurrent screening of the relative expression of hundreds to 
thousands of genes by using, for example, array-based technologies having high density 
formats, including filters, microscope slides, or microchips, or solution-based technologies 
30 that use spectroscopic analysis (e.g., mass spectrometry). One exemplary use of arrays in the 
diagnostic methods of the invention is described below in more detail. 
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Use of a single gene in diagnostic applications The diagnostic methods of the 
invention can focus on the expression of a single differentially expressed gene. For example, 
the diagnostic method can involve detecting a differentially expressed gene, or a 
polymorphism of such a gene (e.g., a polymorphism in an coding region or control region), 
that is associated with disease. Disease-associated polymorphisms can include deletion or 
truncation of the gene, mutations that alter expression level and/or affect activity of the 
encoded protein, etc. 

Changes in the promoter or enhancer sequence that affect expression levels of an 
differentially gene can be compared to expression levels of the normal allele by various 
methods known in the art. Methods for determining promoter or enhancer strength include 
quantitation of the expressed natural protein; insertion of the variant control element into a 

" «s p-gaiaciosioase, lucirerase, chloramphenicol 

acetyltransferase, etc. that provides for convenient quantitation; and the like. 

A number of methods are available for analyzing nucleic acids for the presence of a 
specific sequence, e.g. a disease associated polymorphism. Where large amounts of DNA 
are available, genomic DNA is used directly. Alternatively, the region of interest is cloned 
into a suitable vector and grown in sufficient quantity for analysis. Cells that express a 
differentially expressed gene can be used as a source of mRNA, which can be assayed 
directly or reverse transcribed into cDNA for analysis. The nucleic acid can be amplified by 
conventional techniques, such as the polymerase chain reaction (PCR), to provide sufficient 
amounts for analysis, and a detectable label can be included in the amplification reaction 
(e.g., using a detectably labeled primer or detectably labeled oligonucleotides) to facilitate 
detection. The use of the polymerase chain reaction is described in Saiki, et al, Science 
(1985) 259:487, and a review of techniques can be found in Sambrook, et al, Molecular 
Cloning: A Laboratory Manual, (1989) pp. 14.2. Alternatively, various methods are known 
in the art that utilize oligonucleotide ligation as a means of detecting polymorphisms, for 
examples see Riley et al, Nucl. Acids Res. (1 990) 75:2887; and Delahunty et al, Am. J. 
Hum. Genet. (1996)55:1239. 

The sample nucleic acid, e.g. amplified or cloned fragment, is analyzed by one of a 
number of methods known in the art. The nucleic acid can be sequenced by dideoxy or other 
methods, and the sequence of bases compared to a selected sequence, e.g., to a wild-type 
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sequence. Hybridization with the polymorphic or variant sequence can also be used to 

determine its presence in a sample (e.g., by Southern blot, dot blot, etc.). The hybridization 

pattern of a polymorphic or variant sequence and a control sequence to an array of 

oligonucleotide probes immobilized on a solid support, as described in US 5,445,934, or in 

5 WO 95/35505, can also be used as a means of identifying polymorphic or variant sequences 
associated with disease. Single strand conformational polymorphism (SSCP) analysis, 
denaturing gradient gel electrophoresis (DGGE), and heteroduplex analysis in gel matrices 
are used to detect conformational changes created by DNA sequence variation as alterations 
in electrophoretic mobility. Alternatively, where a polymorphism creates or destroys a 

1 0 recognition site for a restriction endonuclease, the sample is digested with that endonuclease, 
and the products size fractionated to determine whether the fragment was digested. 
Fractionation is performed by gel or capillary electrophoresis, particularly acrylamide or 
agarose gels. 

Screening for mutations in an differentially expressed gene can be based on the 
15 factional or antigenic characteristics of the protein. Protein truncation assays are useful in 
detecting deletions that can affect the biological activity of the protein. Various 
immunoassays designed to detect polymorphisms in proteins can be used in screening. 
Where many diverse genetic mutations lead to a particular disease phenotype, functional 
protein assays have proven to be effective screening tools. The activity of the encoded 
20 protein can be determined by comparison with the wild-type protein. 

p a tt.m notching in ^pn^i* „sin g arrays. In another embodiment, the diagnostic 
and/or prognostic methods of the invention involve detection of expression of a selected set 
of genes in a test sample to produce a test expression pattern (TEP). The TEP is compared to 
a reference expression pattern (REP), which is generated by detection of expression of the 
25 selected set of genes in a reference sample (e.g. , a positive or negative control sample). The 
selected set of genes includes at least one of the genes of the invention, which genes 
correspond to the polynucleotide sequences of SEQ ID NOS: 1 -844. Of particular interest is 
a selected set of genes that includes gene differentially expressed in the disease for which the 

test sample is to be screened. 
30 "Reference sequences" or "reference polynucleotides" as used herein in the context of 

differential gene expression analysis and diagnosis/prognosis refers to a selected set of 
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polynucleotides, which selected set includes at least one or more of the differentially 

expressed polynucleotides described herein. A plurality of reference sequences, preferably 
comprising positive and negative control sequences, can be included as reference sequences. 
Additional suitable reference sequences are found in Genbank, Unigene, and other 
nucleotide sequence databases (including, e.g., expressed sequence tag (EST), partial, and 
full-length sequences). 

"Reference array" means an array having reference sequences for use in hybridization 
with a sample, where the reference sequences include all, at least one of, or any subset of the 
differentially expressed polynucleotides described herein. Usually such an array will include 
at least 3 different reference sequences, and can include any one or all of the provided 
differentially expressed sequences. Arrays of interest can further comprise sequences, 
including polymorphisms, of other genetic sequences, particularly other sequences of interest 
for screening for a disease or disorder {e.g. , cancer, dysplasia, or other related or unrelated 
diseases, disorders, or conditions). The oligonucleotide sequence on the array will usually 
be at least about 12 nt in length, and can be of about the length of the provided sequences, or 
can extend into the flanking regions to generate fragments of 100 nt to 200 nt in length or 
more. 

A "reference expression pattern" or "REP" as used herein refers to the relative levels 
of expression of a selected set of genes, particularly of differentially expressed genes, that is 
associated with a selected cell type, e.g., a normal cell, a cancerous cell, a cell exposed to an 
environmental stimulus, and the like. A "test expression pattern" or "TEP" refers to relative 
levels of expression of a selected set of genes, particularly of differentially expressed genes, 
in a test sample {e.g., a cell of unknown or suspected disease state, from which mRNA is 
isolated). 

"Diagnosis" as used herein generally includes determination of a subject's 
susceptibility to a disease or disorder, determination as to whether a subject is presently 
affected by a disease or disorder, as well as to the prognosis of a subject affected by a disease 
or disorder {e.g., identification of pre-metastatic or metastatic cancerous states, stages of 
cancer, or responsiveness of cancer to therapy). The present invention particularly 
encompasses diagnosis of subjects in the context of breast cancer {e.g., carcinoma in situ 
{e.g., ductal carcinoma in situ), estrogen receptor (ER)-positive breast cancer, ER-negative 
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breast cancer, or other forms and/or stages of breast cancer), lung cancer (e.g., small cell 

carcinoma, non-small cell carcinoma, mesothelioma, and other forms and/or stages of lung 

cancer), and colon cancer (e.g. , adenomatous polyp, colorectal carcinoma, and other forms 

and/or stages of colon cancer). 
5 "Sample" or "biological sample" as used throughout here are generally meant to refer 

to samples of biological fluids or tissues, particularly samples obtained from tissues, 
especially from cells of the type associated with the disease for which the diagnostic 
application is designed (e.g., ductal adenocarcinoma), and the like. "Samples" is also meant 
to encompass derivatives and fractions of such samples (e.g., cell lysates). Where the sample 
10 is solid tissue, the cells of the tissue can be dissociated or tissue sections can be analyzed. 

REPs can be generated in a variety of ways according to methods well known in the 
art. For example, REPs can be generated by hybridizing a control sample to an array having 
a selected set of polynucleotides (particularly a selected set of differentially expressed 
polynucleotides), acquiring the hybridization data from the array, and storing the data in a 
15 format that allows for ready comparison of the REP with a TEP. Alternatively, all expressed 
sequences in a control sample can be isolated and sequenced, e.g., by isolating mRNA from 
a control sample, converting the mRNA into cDNA, and sequencing the cDNA. The 
resulting sequence information roughly or precisely reflects the identity and relative number 
of expressed sequences in the sample. The sequence information can then be stored in a 
20 format (e.g., a computer-readable format) that allows for ready comparison of the REP with 
a TEP. The REP can be normalized prior to or after data storage, and/or can be processed to 
selectively remove sequences of expressed genes that are of less interest or that might 
complicate analysis (e.g., some or all of the sequences associated with housekeeping genes 
can be eliminated from REP data). 
25 TEPs can be generated in a manner similar to REPs, e.g. , by hybridizing a test sample 

to an array having a selected set of polynucleotides, particularly a selected set of 
differentially expressed polynucleotides, acquiring the hybridization data from the array, and 
storing the data in a format that allows for ready comparison of the TEP with a REP. The 
REP and TEP to be used in a comparison can be generated simultaneously, or the TEP can 
30 be compared to previously generated and stored REPs. 
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In one embodiment of the invention, comparison of a TEP with a REP involves 
hybridizing a test sample with a reference array, where the reference array has one or more 
reference sequences for use in hybridization with a sample. The reference sequences include 
all, at least one of, or any subset of the differentially expressed polynucleotides described 
herein. Hybridization data for the test sample is acquired, the data normalized, and the 
produced TEP compared with a REP generated using an array having the same or similar 
selected set of differentially expressed polynucleotides. Probes that correspond to sequences 
differentially expressed between the two samples will show decreased or increased 
hybridization efficiency for one of the samples relative to the other. 

Reference arrays can be produced according to any suitable methods known in the 
art. For example, methods of producing large arrays of oligonucleotides are described in 
U.S. 5,134,854, and U.S. 5,445,934 using light-directed synthesis techniques. Using a 
computer controlled system, a heterogeneous array of monomers is converted, through 
simultaneous coupling at a number of reaction sites, into a heterogeneous array of polymers. 
Alternatively, microarrays are generated by deposition of pre-synthesized oligonucleotides 
onto a solid substrate, for example as described in PCT published application no. 
WO 95/35505. 

Methods for collection of data from hybridization of samples with a reference arrays 
are also well known in the art. For example, the polynucleotides of the reference and test 
20 samples can be generated using a detectable fluorescent label, and hybridization of the 

polynucleotides in the samples detected by scanning the microarrays for the presence of the 
detectable label. Methods and devices for detecting fluorescently marked targets on devices 
are known in the art. Generally, such detection devices include a microscope and light 
source for directing light at a substrate. A photon counter detects fluorescence from the 
substrate, while an x-y translation stage varies the location of the substrate. A confocal 
detection device that can be used in the subject methods is described in U.S. Patent no. 
5,631,734. A scanning laser microscope is described in Shalon et al., Genome Res. (1996) 
6:639. A scan, using the appropriate excitation line, is performed for each fluorophore used. 
The digital images generated from the scan are then combined for subsequent analysis. For 
any particular array element, the ratio of the fluorescent signal from one sample (e.g., a test 
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sample) is compared to the fluorescent signal from another sample (e.g., a reference sample), 

and the relative signal intensity determined. 

Methods for analyzing the data collected from hybridization to arrays are well known 
in the art. For example, where detection of hybridization involves a fluorescent label, data 
5 analysis can include the steps of determining fluorescent intensity as a function of substrate 
position from the data collected, removing outliers, i.e. data deviating from a predetermined 
statistical distribution, and calculating the relative binding affinity of the targets from the 
remaining data. The resulting data can be displayed as an image with the intensity in each 
region varying according to the binding affinity between targets and probes. 
10 In general, the test sample is classified as having a gene expression profile 

corresponding to that associated with a disease or non-disease state by comparing the TEP 
generated from the test sample to one or more REPs generated from reference samples (e.g., 
from samples associated with cancer or specific stages of cancer, dysplasia, samples affected 
by a disease other than cancer, normal samples, etc.). The criteria for a match or a 
15 substantial match between a TEP and a REP include expression of the same or substantially 
the same set of reference genes, as well as expression of these reference genes at 
substantially the same levels (e.g., no significant difference between the samples for a signal 
associated with a selected reference sequence after normalization of the samples, or at least 
no greater than about 25% to about 40% difference in signal strength for a given reference 
20 sequence. In general, a pattern match between a TEP and a REP includes a match in 

expression, preferably a match in qualitative or quantitative expression level, of at least one 
of, all or any subset of the differentially expressed genes of the invention. 

Pattern matching can be performed manually, or can be performed using a computer 
program. Methods for preparation of substrate matrices (e.g., arrays), design of 
25 oligonucleotides for use with such matrices, labeling of probes, hybridization conditions, 
scanning of hybridized matrices, and analysis of patterns generated, including comparison 
analysis, are described in, for example, U.S. 5,800,992. 

p TTse. of the Polynucleotides of the Inv ention in Cancer 
Oncogenesis involves the unbridled growth, dedifferentiation and abnormal 
30 migration of cells. Cancerous cells can have the ability to compress, invade, and destroy 
normal tissue. Cancerous cells may also metastasize to other parts of the body via the 
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bloodstream or the lymph system and colonize in these other areaT Different cancers are 

classified by the cell from which the cancerous cell is derived and from its cellular 
morphology and/or state of differentiation. 

Somatic genetic abnormalities cause cancer initiation and progression. Cancer 
generally is clonally formed, i.e. gain of function of oncogenes and loss of function of tumor 
suppressor genes within a single cell transform the cell to be cancerous, and that single cell 
grows and divides to form a cancerous lesion. The genes known to be involved in cancer 
initiation and progression are involved in numerous cellular functions, including 
developmental differentiation, cell cycle regulation, cell signaling, immunological response, 
DNA replication, and DNA repair. 

The identification and characterization of genetic or biochemical markers in blood or 
tissues that will detect the earliest changes along the carcinogenesis nathwav *nH n,™;,™ 
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efficacy of various therapies and preventive interventions is a major goal of cancer research. 
Scientists have identified genetic changes in stool specimens that indicate the stages of colon 
cancer, and other biomarkers such as gene mutations, hormone receptors, proteins that 
inhibit metastasis, and enzymes that metabolize drugs are all being used to determine the 
severity and predict the course of breast, prostate, lung, and other cancers. 

Recent advances in the pathogenesis of certain cancers has been helpful in 
determining patient treatment. The level of expression of certain polynucleotides can be 
indicative of a poorer prognosis, and therefore warrant more aggressive chemo- or radio- 
therapy for a patient. The correlation of novel surrogate tumor specific features with 
response to treatment and outcome in patients has defined certain prognostic indicators that 
allow the design of tailored therapy based on the molecular profile of the tumor. These 
therapies include antibody targeting and gene therapy. Moreover, a promising level of one 
or more marker polynucleotides can provide impetus for not aggressively treating a 
particular patient, thus sparing the patient the deleterious side effects of aggressive therapy. 
Determining expression of certain polynucleotides and comparison of a patients profile with 
known expression in normal tissue and variants of the disease allows a determination of the 
best possible treatment for a patient, both in terms of specificity of treatment and in terms of 
comfort level of the patient. 
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Surrogate tumor markers, such as polynucleotide expression, can also be used to 
better classify, and thus diagnose and treat, different forms and disease states of cancer. 
Two classifications widely used in oncology that can benefit from identification of the 
expression levels of the polynucleotides of the invention are staging of the cancerous 

5 disorder, and grading the nature of the cancerous tissue. 

Staging. Staging is a process used by physicians to describe how advanced the 
cancerous state is in a patient. Staging assists the physician in determining a prognosis, 
planning treatment and evaluating the results of such treatment. Different staging systems 
are used for different types of cancer, but each generally involves the following 

1 0 determinations: the type of tumor, indicated by T; whether the cancer has metastasized to 
nearby lymph nodes, indicated by N; and whether the cancer has metastasized to more 
distant parts of the body, indicated by M. This system of staging is called the TNM system. 
Generally, if a cancer is only detectable in the area of the primary lesion without having 
spread to any lymph nodes it is called Stage I. If it has spread only to the closest lymph 

1 5 nodes, it is called Stage II. In Stage III, the cancer has generally spread to the lymph nodes 
in near proximity to the site of the primary lesion. Cancers that have spread to a distant part 
of the body, such as the liver, bone, brain or another site, are called Stage IV, the most 
advanced stage. 

Currently, the determination of staging is done using pathological techniques and is 
20 based more on the presence or absence of malignant tissue rather than the characteristics of 
the tumor type. Presence or absence of malignant tissue is based primarily on the gross 
morphology of the cells in the areas biopsied. The polynucleotides of the invention can 
facilitate fine-tuning of the staging process by identifying markers for the aggresivity of a 
cancer, e.g. the metastatic potential, as well as the presence in different areas of the body. 
25 Thus, a Stage II cancer with a polynucleotide signifying a high metastatic potential cancer 
can be used to change a borderline Stage II tumor to a Stage III tumor, justifying more 
aggressive therapy. Conversely, the presence of a polynucleotide signifying a lower 
metastatic potential allows more conservative staging of a tumor. 

Grading of cancers. Grade is a term used to describe how closely a tumor resembles 
30 normal tissue of its same type. Based on the microscopic appearance of a tumor, 

pathologists will identify the grade of a tumor based on parameters such as cell morphology, 
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cellular organization, and other markers of differentiation. As a general rule, the grade of a 
tumor corresponds to its rate of growth or aggressiveness. » That is, undifferentiated or high- 
grade tumors grow more quickly than well differentiated or low-grade tumors. Information 
about tumor grade is useful in planning treatment and predicting prognosis. 

The American Joint Commission on Cancer has recommended the following 
guidelines for grading tumors: 1) GX Grade cannot be assessed; 2) Gl Well differentiated; 
G2 Moderately well differentiated; 3) G3 Poorly differentiated; 4) G4 Undifferentiated. 
Although grading is used by pathologists to describe most cancers, it plays a more important 
role in treatment planning for certain types than for others. An example is the Gleason 
system that is specific for prostate cancer, which uses grade numbers to describe the degree 
of differentiation. Lower Gleason scores indicate weli-differentiated cells. Intermediate 
scores denote tumors with moderately differentiated cells. Higher scores describe poorly 
differentiated cells. Grade is also important in some types of brain tumors and soft tissue 
sarcomas. 

The polynucleotides of the invention can be especially valuable in determining the 
grade of the tumor, as they not only can aid in determining the differentiation status of the 
cells of a tumor, they can also identify factors other than differentiation that are valuable in 
determining the aggressivity of a tumor, such as metastatic potential. 

Familial Cancer Genes A number of cancer syndromes are linked to Mendelian 
20 inheritance of a predisposition to develop particular cancers. The following table contains a 
list of cancer types that can be inherited, and for which the gene or genes responsible have 
been identified. Most of the cancer types listed can occur as part of several different genetic 
conditions, each caused by alterations in a different gene. 



Cancer Type 


Genetic Condition 


Gene 


Brain 


Li-Fraumeni syndrome 
Neurofibromatosis I 
Neurofibromatosis 2 
von Hippel-Lindau syndrome 
Tuberous sclerosis 2 


TP53 

NFl 

NF2 

VHL 

TSC2 


Breast 


Hereditary breast/ovarian cancer 1 
Hereditary breast/ovarian cancer 2 
Li-Fraumeni syndrome 
Ataxia telangiectasia 


BRCAl 
BRCA2 
TP53 
ATM 


Colon 


Familial adenomatous polyposis (FAP) 
Hereditary non-polyposis colon cancer (HNPCC) 1 
Hereditary non-polyposis colon cancer (HNPCC) 2 


APC 

HMSH2 

hMLHl 
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Cancer Type 


Genetic Condition 
Hereditary non-polyposis colon cancer (HNPCC) 3 
Hereditary non-polyposis colon cancer (HNPCC) 4 


Gene 
hPMSl 
hPMS2 


Endocrine 

(parathyroid, pituitary, GI endocrine) 


Xvfitl+tnlc* on/4nr>rinp nf»s^nlftcia 1 flMlilMl 1 

Iviuiiipie enoocnnc iicujjiaoia i yiv*i-»*^*/ 


MEN1 


Endocrine 

(pheochromacytoma, medullary thyroid) 


Multiple endocrine neoplasia 2 (MEN2) 


RET 


Endometrial 


Hereditary non-polyposis colon cancer (HNPCC) 1 
Hereditary non-polyposis colon cancer (HNPCC) 2 
Hereditary non-polyposis colon cancer (HNPCC) 3 
Hereditary non-polyposis colon cancer (HNPCC) 4 


hMSH2 
hMLHl 
hPMSl 
hPMS2 
RBI 


Eye 

Hematologic 

(lymphomas and leukemia) 


Hereditary retinoblastoma 
Li-Fraumeni syndrome 

Ataxia telangiectasia 


TP53 
ATM 


Kidney 


Hereditary Wilms 1 tumor 
von Hippel-Lindau syndrome 
Tuberous sclerosis 2 


WT1 
VHL 
TSC2 


Ovary 


Hereditary breast/ovarian cancer 1 
Hereditary breast/ovarian cancer 2 


BRCA1 
BRCA2 


Sarcoma 


Hereditary retinoblastoma 
Li-Fraumeni syndrome 
Neurofibromatosis 1 


RBI 
TP53 


Skin 


Hereditary melanoma 1 
Hereditary melanoma 2 
Basal cell naevus (Gorlin) syndrome 


CDKN2 

CDK4 

PTCH 


Stomach 


Hereditary non-polyposis colon cancer (HNPCC) 1 
Hereditary non-polyposis colon cancer (HNPCC) 2 
Hereditary non-polyposis colon cancer (HNPCC) 3 
Hereditary non-polyposis colon cancer (HNPCC) 4 


HMSH2 
hMLHl 
hPMSl 
hPMS2 
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The polynucleotides of the invention can be especially useful to monitor patients having any 
of the above syndromes to detect potentially malignant events at a molecular level before 
they are detectable at a gross morphological level. As can be seen from the table, a number 
of genes are involved in multiple forms of cancer. Thus, a polynucleotide of ihe invention 
identified as important for metastatic colon cancer can also have clinical implications for a 
patient diagnosed with stomach cancer or endometrial cancer. 

Lung Cancer. Lung cancer is one of the most common cancers in the United States, 
accounting for about 15 percent of all cancer cases, or 170,000 new cases each year. At this 
time, over half of the lung cancer cases in the United States are in men, but the number found 
in women is increasing and will soon equal that in men. Today more women die of lung 
cancer than of breast cancer. Lung cancer is especially difficult to diagnose and treat 
because of the large size of the lungs, which allows cancer to develop for years undetected. 
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In fact, lung cancer can spread outside the lungs without causing any symptoms. Adding to 
the confusion, the most common symptom of lung cancer, a persistent cough, can often be 
mistaken for a cold or bronchitis. 

Although there are more than a dozen different kinds of lung cancer, the two main 
types of lung cancer are small cell and nonsmall cell, which encompass about 90% of all 
lung cancer cases. Small cell carcinoma (also called oat cell carcinoma), which usually starts 
in one of the larger bronchial tubes, grows fairly rapidly, and is likely to be large by the time 
of diagnosis. Nonsmall cell lung cancer (NSCLC) is made up of three general subtypes of 
lung cancer. Epidermoid carcinoma (also called squamous cell carcinoma) usually starts in 
one of the larger bronchial tubes and grows relatively slowly. The size of these tumors can 
range from very small to quite large. Adenocarcinoma starts growing near the outside 

surface nf thp h lnrr onH r»ar» iron/ i*^ Ks>+Vi - L i_ n i <• 
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adenocarcinomas are described as alveolar cell cancer. Large cell carcinoma starts near the 
surface of the lung, grows rapidly, and the growth is usually fairly large when diagnosed. 
Other less common forms of hing cancer are carcinoid, cylindroma, mucoepidermoid, and 
malignant mesothelioma. 

Currently, CT scans, MRIs, X-rays, sputum cytology, and biopsies are used to 
diagnose nonsmall cell lung cancer. The form and cellular origin of the lung cancer is 
diagnosed primarily through biopsy from either a surgical biopsy or a needle aspiration of 
lung tissue, and usually the biopsy is prompted from an abnormality identified on an X-ray. 
In some cases, sputum cytology can reveal lung cancers in patients with normal X-rays or 
can determine the type of lung cancer, but because it cannot pinpoint the tumor's location, a 
positive sputum cytology test is usually followed by further tests. Since these tests are based 
in large part on gross morphology of the tissue, the diagnosis of a particular kind of tumor is 
largely subjective, and the diagnosis can vary significantly between clinicians. 

The polynucleotides of the invention can be used to distinguish types of lung cancer 
as well as identifying traits specific to a certain patient's cancer. For example, if the patient's 
biopsy expresses a polynucleotide that is associated with a low metastatic potential, it may 
justify leaving a larger portion of the patient's lung in surgery to remove the lesion. 
Alternatively, a smaller lesion with expression of a polynucleotide that is associated with 
high metastatic potential may justify a more radical removal of lung tissue and/or the 
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suixounding lymph nodes, even if no metastasis can be identified through pathological 
examination. 

Similarly, the expression of polynucleotides of the invention can be used in the 
diagnosis, prognosis and management of colorectal cancer. The differential expression of a 
5 polynucleotide in hyperplasia can be used as a diagnostic marker for metastatic lung cancer. 
The polynucleotides of the invention that would be especially useful for this purpose are 
those that exhibit differential expression between high metastatic versus low metastatic lung 
cancer , i.e. SEQ IDNOS: 9, 34, 42, 62, 74, 106, 119, 135, 154, 160, 260, 308, 323, 349, 
361, 369, 371, 381, 395, and 400. Detection of malignant lung cancer with a higher 
1 0 metastatic potential can be determined using expression levels of any of these sequences 
alone or in combination with the levels of expression of other known genes. 

Breast Cancer. The National Cancer Institute (NCI) estimates that about 1 in 8 
women in the United States will develop breast cancer during her lifetime. Clinical breast 
examination and mammography are recommended as combined modalities for breast cancer 
1 5 screening, and the nature of the cancer will often depend upon the location of the tumor and 
the cell type from which the tumor is derived. The majority of breast cancers are 
adenocarcinomas subtypes, which can be summarized as follows: 

Ductal carcinoma in situ (DCIS): Ductal carcinoma in situ is the most common type 
of noninvasive breast cancer. In DCIS, the malignant cells have not metastasized through 
20 the walls of the ducts into the fatty tissue of the breast. Comedocarcinoma is a type of DCIS 
that is more likely than other types of DCIS to come back in the same area after 
lumpectomy. It is more closely linked to eventual development of invasive ductal carcinoma 

than other forms of DCIS. 

Infiltrating (or invasive) ductal carcinoma (IDC): this type of cancer has metastasized 
25 through the wall of the duct and invaded the fatty tissue of the breast. At this point, it has the 
potential to use the lymphatic system and bloodstream for metastasis to more distant parts of 
the body. Infiltrating ductal carcinoma accounts for about 80% of breast cancers. 

Lobular carcinoma in situ (LCIS): While not a true cancer, LCIS (also called lobular 
neoplasia) is sometimes classified as a type of noninvasive breast cancer. It does not 
30 penetrate through the wall of the lobules. Although it does not itself usually become an 
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invasive cancer, women with this condition have a higher risk of developing an invasive 
breast cancer in the same breast, or in the opposite breast. 

Infiltrating (or invasive) lobular carcinoma (ILC): ILC is similar to IDC, in that it has 
the potential metastasize elsewhere in the body. About 10% to 15% of invasive breast 
cancers are invasive lobular carcinomas. ILC can be more difficult to detect by 
mammogram than IDC. 

Inflammatory breast cancer: This rare type of invasive breast cancer accounts for 
about 1% of all breast cancers and is extremely aggressive. Multiple skin symptoms 
associated with this cancer are caused by cancer cells blocking lymph vessels or channels in 
the skin over the breast. 

Medullary carcinoma: This special type of infiltrating breast cancer has a relatively 

well defined, distinct boundary hpfwppn ti 

— ^ w^u^w aim ixuiiiiai ussuc. n accounts ior 

about 5% of breast cancers. The prognosis for this kind of breast cancer is better than for 
other types of invasive breast cancer. 

Mucinous carcinoma: This rare type of invasive breast cancer originates from mucus- 
producing cells. The prognosis for mucinous carcinoma is better than for the more common 
types of invasive breast cancer. 

Paget's disease of the nipple: This type of breast cancer starts in the ducts and spreads 
to the skin of the nipple and the areola. It is a rare type of breast cancer, occurring in only 
1% of all cases. Paget's disease can be associated with in situ carcinoma, or with infiltrating 
breast carcinoma. If no lump can be felt in the breast tissue, and the biopsy shows DCIS but 
no invasive cancer, the prognosis is excellent. 

Phyllodes tumor: This very rare type of breast tumor forms from the stroma of the 
breast, in contrast to carcinomas which develop in the ducts or lobules. Phyllodes (also 
spelled phylloides) tumors are usually benign, but are malignant on rare occasions. 
Nevertheless, malignant phyllodes tumors are very rare and less than 10 women per year in 
the US die of this disease. Benign phyllodes tumors are successfully treated by removing the 
mass and a narrow margin of normal breast tissue. 

Tubular carcinoma: Accounting for about 2% of all breast cancers, tubular 
carcinomas are a special type of infiltrating breast carcinoma. They have a better prognosis 
than usual infiltrating ductal or lobularcarcinomas. 
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High-quality mammography combined with clinical breast exam remains the only 

screening method clearly tied to reduction in breast cancer mortality. Lower dose x-rays, 
digitized computer rather than film images, and the use of computer programs to assist 
diagnosis, are almost ready for widespread dissemination. Other technologies also are being 
5 developed, including magnetic resonance imaging and ultrasound. In addition, a very low 
radiation exposure technique, positron emission tomography has the potential for detecting 
early breast cancer. 

It is also possible to differentiate between non-cancerous breast tissue and malignant 
breast tissue by analyzing differential gene expression between tissues. In addition, there 
10 may be several possible alterations that lead to the various possible types of breast cancer. 
The different types of breast tumors {e.g. , invasive vs. non-invasive, ductal vs. axillary 
lymph node) can be differentiable from one another by the identification of the differences in 
genes expressed by different types of breast tumor tissues (Porter-Jordan et al., Hematol 
Oncol Clin North Am (1994) 8:73). Breast cancer can thus be generally diagnosed by 
15 detection of expression of a gene or genes associated with breast tumors. Where enough 
information is available about the differential gene expression between various types of 
breast tumor tissues, the specific type of breast tumor can also be diagnosed. 

For example, increased estrogen receptor (ER) expression in normal breast 
epithileum, while not itself indicative of malignant tissue, is a known risk marker for 
20 development of breast cancer. Khan SA et al., Cancer Res (1994) 5*993. Malignant breast 
cancer is often divided into two groups, ER-positive and ER-negative, based on the estrogen 
receptor status of the tissue. The ER status represents different survival length and response 
to hormone therapy, and is thought to represent either: 1) an indicator of different stages of 
the disease, or 2) an indicator that allows differentiation between two similar but distinct 
25 diseases. K. Zhu et al, Med. Hypoth. (1997) 49:69. A number of other genes are known to 
vary expression between either different stages of cancer or different types of similar breast 



cancer. 



Similarly, the expression of polynucleotides of the invention can be used in the 
diagnosis and management of breast cancer. The differential expression of a polynucleotide 
30 in human breast tumor tissue can be used as a diagnostic marker for human breast cancer. 
The polynucleotides of the invention that would be especially useful for this purpose are 
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those that exhibit differential expression between breast cancer tissue with a high metastatic 
potential and a low metastatic potential, i.e. SEQ ID NOS: 9, 42, 52, 62, 65, 66, 68, 1 14, 
123, 144, 172, 178, 214, 219, 223, 258, 317, and 379. Detection of breast cancer can be 
determined using expression levels of any of these sequences alone or in combination. 
Determination of the aggressive nature and/or the metastatic potential of a breast cancer can 
also be determined by comparing levels of one or more polynucleotides of the invention and 
comparing levels of another sequence known to vary in cancerous tissue, e.g. ER expression. 
In addition, development of breast cancer can be detected by examining the ratio of SEQ ID 
NO: to the levels of steroid hormones (e.g., testosterone or estrogen) or to other hormones 
(e.g. , growth hormone, insulin). Thus expression of specific marker polynucleotides can 
be used to discriminate between normal and cancerous breast tissue, to discriminate between 
breast cancers with different cells of origin, to discriminate between breast cancers with 
different potential metastatic rates, etc. 

Diagnosis of breast cancer can also involve comparing the expression of a 
polynucleotide of the invention with the expression of other sequences in non-malignant 
breast tissue samples in comparison to one or more forms of the diseased tissue. A 
comparison of expression of one or more polynucleotides of the invention between the 
samples provides information on relative levels of these polynucleotides as well as the ratio 
of these polynucleotides to the expression of other sequences in the tissue of interest 
compared to normal. 

This risk of breast cancer is elevated significantly by the presence of an inherited risk 
for breast cancer, such as a mutation in BRCA-1 or BRCA-2. New diagnostic tools are 
being developed to address the needs of higher risk patients to complement mammography 
and physical examinations for early detection of breast cancer, particularly among younger 
women. The presence of antigen or expression markers in nipple aspirate fluid (NAF) 
samples collected from one or both breasts can be useful for useful for risk assessment or 
early cancer detection. Breast cytology and biomarkers obtained by random fine needle 
aspiration have been used to identify hyperplasia with atypia and overexpression of p53 and 
EGFR. The polynucleotides of the invention can be used in multivariate analysis with 
expression studies with genes such as p53 and EGFR as risk predictors and as surrogate 
endpoint biomarkers for breast cancer. 
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As well as being used for diagnosis and risk assessment, the expression of certain 
genes can also correlated to prognosis of a disease state. The expression of particular gene 
have been used as prognostic indicators for breast cancer including increased expression of 
c-erbB-2, pS2, ER, progesterone receptor, epidermal growth factor receptor (EGFR), neu, 

5 myc, bcl-2, int2, cytosolic tyrosine kinase, cyclin E, prad-1 , hst , uPA, P AI- 1 , PAI-2, 

cathepsin D, as well as the presence of a number of cancer-specific antigens, e.g. CEA, CA 
M26, CA M29 and CA 15.3. Davis, Br. J. BiomedSci. (1996) 55:157. Poor prognosis has 
also been linked to a decrease in expression of certain genes, such asp53, Rb, nm23. The 
expression of the polynucleotides of the invention can be of prognostic value for determining 

1 0 the metastatic potential of a malignant breast cancer, as this molecules are differentially 
expressed between high and low metastatic potential tissues tumors. The levels of these 
polynucleotides in patients with malignant breast cancer can compared to normal tissue, 
malignant tissue with a known high potential metastatic level, and malignant tissue with a 
known lower level of metastatic potential to provide a prognosis for a particular patient. 

1 5 Such a prognosis is predictive of the extent and nature of the cancer. The determined 
prognosis is useful in determining the prognosis of a patient with breast cancer, both for 
initial treatment of the disease and for longer-term monitoring of the same patient. If 
samples are taken from the same individual over a period of time, differences in 
polynucleotide expression that are specific to that patient can be identified and closely 

20 watched. 

Colon Cancer. Colorectal cancer is one of the most common neoplasms in humans 
and perhaps the most frequent form of hereditary neoplasia. Prevention and early detection 
are key factors in controlling and curing colorectal cancer. Indeed, colorectal cancer is the 
second most preventable cancer, after lung cancer. Colorectal cancer begins as polyps, 

25 which are small, benign growths of cells that form on the inner lining of the colon. Over a 
period of several years, some of these polyps accumulate additional mutations and become 
cancerous. About 20 percent of all cases of colon cancer are thought to be related to 
heredity. Currently, multiple familial colorectal cancer disorders have been identified, which 
are summarized as follows: 

30 Familial adenomatous polyposis (FAP): This condition results in a person having 

hundreds or even thousands of polyps in the colon and rectum that usually first appear during 
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the teenage years. Cancer nearly always develops in one or more of these polyps between 
the ages of 30 and 50. 

Gardner's syndrome: Like FAP, Gardner's syndrome results in polyps and colorectal 
cancers that develop at a young age. It can also cause benign tumors of the skin, soft 
connective tissue and bones. 

Hereditary nonpolyposis colon cancer (HNPCC): People with this condition tend to 
develop colorectal cancer at a young age, without first having many polyps. HNPCC has an 
autosomal dominant pattern of inheritance with variable but high penetrance estimated to be 
about 90%. HNPCC underlies 0.5%-10% of all cases of colorectal cancer. An understanding 
of the mechanisms behind the development of HNPCC is emerging, and genetic 
presymptomatic testing, now being conducted in research settings, soon will be available on 
a widespread basis for individuals identified 

" - — " wiiu uikJVUOV • 

Familial colorectal cancer in Ashkenazi Jews: Recent research has found an inherited 
tendency to developing colorectal cancer among some Jews of Eastern European descent. 
Like people with FAP, Gardner's syndrome, and HNPCC, their increased risk is due to an 
inherited mutation present in about 6% of American Jews. 

Several tests are currently used to screen for colorectal cancer, including digital rectal 
examination, fecal occult blood test, sigmoidoscopy, colonoscopy, virtual colonoscopy and 
MRI. Each of these tests identifies potential colorectal cancer lesions, or a risk of 
development of these lesions, at a fairly gross morphological level. 

The sequential alteration of a number of genes is associated with malignant 
adenocarcinoma, including the genes DCC, p53, ras, and FAP. For a review, see e.g. Fearon 
ER, et al., Cell (1990) 61(5):759; Hamilton SR et al, Cancer (1993) 72:957; Bodmer W, et 
al.,NatGenet. (1994) 4(3):2\7; Fearon ER, Ann N YAcad Sci. (1995) 768:101. Molecular 
genetic alterations are thus promising as potential diagnostic and prognostic indicators in 
colorectal carcinoma and molecular genetics of colorectal carcinoma since it is possible to 
differentiate between different types of colorectal neoplasias using molecular markers. 
Colorectal cancer can thus be generally diagnosed by detection of expression of a gene or 
genes associated with colorectal tumors. 

Similarly, the expression of polynucleotides of the invention can be used in the 
diagnosis, prognosis and management of colorectal cancer. The differential expression of a 
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polynucleotide in hyperplasia can be used as a diagnostic marker for colon cancer. The 
polynucleotides of the invention that would be especially useful for this purpose are those 
that exhibit differential expression between malignant metastatic colon cancer and normal 
patient tissue , i.e. SEQ ID NOS: 52, 1 19, 172, 288. Detection of malignant colon cancer 
5 can be determined using expression levels of any of these sequences alone or in combination 
with the levels of expression. 

Determination of the aggressive nature and/or the metastatic potential of a colon 
cancer can also be determined by comparing levels of one or more polynucleotides of the 
invention and comparing total levels of another sequence known to vary in cancerous tissue, 
10 e.g. p53 expression. In addition, development of colon cancer can be detected by examining 
the ratio of any of the polynucleotides of the invention to the levels of oncogenes {e.g. ras) 
or tumor suppressor genes {e.g. FAP or p53). Thus expression of specific marker 
polynucleotides can be used to discriminate between normal and cancerous breast tissue, to 
discriminate between breast cancers with different cells of origin, to discriminate between 
15 breast cancers with different potential metastatic rates, etc. 

G. Use of Polynucleotides to Screen for Peptide Analogs an d Antagonists 
Polypeptides encoded by the instant polynucleotides and corresponding full length 
genes can be used to screen peptide libraries to identify binding partners, such as receptors, 
from among the encoded polypeptides. 
20 A library of peptides can be synthesized following the methods disclosed in U.S. Pat. 

No. 5,010,175 ('175), and in WO 91/17823. As described below in brief, one prepares a 
mixture of peptides, which is then screened to identify the peptides exhibiting the desired 
signal transduction and receptor binding activity. In the '175 method, a suitable peptide 
synthesis support (e.g., a resin) is coupled to a mixture of appropriately protected, activated 
25 amino acids. The concentration of each amino acid in the reaction mixture is balanced or 

adjusted in inverse proportion to its coupling reaction rate so that the product is an equimolar 
mixture of amino acids coupled to the starting resin. The bound amino acids are then 
deprotected, and reacted with another balanced amino acid mixture to form an equimolar 
mixture of all possible dipeptides. This process is repeated until a mixture of peptides of the 
30 desired length {e.g., hexamers) is formed. Note that one need not include all amino acids in 
each step: one can include only one or two amino acids in some steps {e.g., where it is 
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known that a particular amino acid is essential in a given position), thus reducing the 

complexity of the mixture. After the synthesis of the peptide library is completed, the 
mixture of peptides is screened for binding to the selected polypeptide. The peptides are 
then tested for their ability to inhibit or enhance activity. Peptides exhibiting the desired 
5 activity are then isolated and sequenced. 

The method described in WO 91/17823 is similar. However, instead of reacting the 
synthesis resin with a mixture of activated amino acids, the resin is divided into twenty equal 
portions (or into a number of portions corresponding to the number of different amino acids 
to be added in that step), and each amino acid is coupled individually to its portion of resin. 
1 0 The resin portions are then combined, mixed, and again divided into a number of equal 

portions for reaction with the second amino acid. In this manner, each reaction can be easily 
driven to completion. Additionally, one can maintain separate "subpools" by treating 
portions in parallel, rather than combining all resins at each step. This simplifies the process 
of determining which peptides are responsible for any observed receptor binding or signal 
15 transduction activity. 

In such cases, the subpools containing, e.g., 1-2,000 candidates each are exposed to 
one or more polypeptides of the invention. Each subpool that produces a positive result is 
then resynthesized as a group of smaller subpools (sub-subpools) containing, e.g., 20-100 
candidates, and reassayed. Positive sub-subpools can be resynthesized as individual 
20 compounds, and assayed finally to determine the peptides that exhibit a high binding 
constant. These peptides can be tested for their ability to inhibit or enhance the native 
activity. The methods described in WO 91/7823 and U.S. Patent No. 5,194,392 (herein 
incorporated by reference) enable the preparation of such pools and subpools by automated 
techniques in parallel, such that all synthesis and resynthesis can be performed in a matter of 
25 days. 

Peptide agonists or antagonists are screened using any available method, such as 
signal transduction, antibody binding, receptor binding, mitogenic assays, chemotaxis 
assays, etc. The methods described herein are presently preferred. The assay conditions 
ideally should resemble the conditions under which the native activity is exhibited in vivo, 
30 that is, under physiologic pH, temperature, and ionic strength. Suitable agonists or 
antagonists will exhibit strong inhibition or enhancement of the native activity at 
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concentrations that do not cause toxic side effects in the subject. Agonists or antagonists that 
compete for binding to the native polypeptide can require concentrations equal to or greater 
than the native concentration, while inhibitors capable of binding irreversibly to the 
polypeptide can be added in concentrations on the order of the native concentration. 
5 The end results of such screening and experimentation will be at least one novel 

polypeptide binding partner, such as a receptor, encoded by a gene or a cDNA corresponding 
to a polynucleotide of the invention, and at least one peptide agonist or antagonist of the 
novel binding partner. Such agonists and antagonists can be used to modulate, enhance, or 
inhibit receptor function in cells to which the receptor is native, or in cells that possess the 
1 0 receptor as a result of genetic engineering. Further, if the novel receptor shares biologically 
important characteristics with a known receptor, information about agonist/antagonist 
binding can facilitate development of improved agonists/antagonists of the known receptor. 
H. Pharmaceutical Compositions and Therapeutic Uses 
Pharmaceutical compositions can comprise polypeptides, antibodies, or 
15 polynucleotides of the claimed invention. The pharmaceutical compositions will comprise a 
therapeutically effective amount of either polypeptides, antibodies, or polynucleotides of the 
claimed invention. 

The term "therapeutically effective amount" as used herein refers to an amount of a 
therapeutic agent to treat, ameliorate, or prevent a desired disease or condition, or to exhibit a 

20 detectable therapeutic or preventative effect. The effect can be detected by, for example, 
chemical markers or antigen levels. Therapeutic effects also include reduction in physical 
symptoms, such as decreased body temperature. The precise effective amount for a subject 
will depend upon the subject's size and health, the nature and extent of the condition, and the 
therapeutics or combination of therapeutics selected for administration. Thus, it is not useful 

25 to specify an exact effective amount in advance. However, the effective amount for a given 
situation is determined by routine experimentation and is within the judgment of the 
clinician. For purposes of the present invention, an effective dose will generally be from 
about 0.01 mg/ kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg of the DNA constructs in 
the individual to which it is administered. 

30 A pharmaceutical composition can also contain a pharmaceutically acceptable carrier. 

The term "pharmaceutically acceptable carrier" refers to a carrier for administration of a 
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therapeutic agent, such as antibodies or a polypeptide, genes, and other therapeutic agents. 

The term refers to any pharmaceutical carrier that does not itself induce the production of 
antibodies harmful to the individual receiving the composition, and which can be 
administered without undue toxicity. Suitable carriers can be large, slowly metabolized 
macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, 
polymeric amino acids, amino acid copolymers, and inactive virus particles. Such carriers 
are well known to those of ordinary skill in the art. 

Pharmaceutically acceptable salts can be used therein, for example, mineral acid salts 
such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and the salts of 
organic acids such as acetates, propionates, malonates, benzoates, and the like. A thorough 
discussion of pharmaceutically acceptable excipients is available in Remington's 
Pharmaceutical Sciences (Mack Pub. Co., N.J. 1991). 

Pharmaceutically acceptable carriers in therapeutic compositions can include liquids 
such as water, saline, glycerol and ethanol. Auxiliary substances, such as wetting or 
emulsifying agents, pH buffering substances, and the like, can also be present in such 
vehicles. Typically, the therapeutic compositions are prepared as injectables, either as liquid 
solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid 
vehicles prior to injection can also be prepared. Liposomes are included within the 
definition of a pharmaceutically acceptable carrier. 

Delivery Methods. Once formulated, the compositions of the invention can be 
(1) administered directly to the subject {e.g., as polynucleotide or polypeptides); (2) 
delivered ex vivo, to cells derived from the subject {e.g., as in ex vivo gene therapy); or (3) 
delivered in vitro for expression of recombinant proteins {e.g., polynucleotides). Direct 
delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperitoneally, intravenously or intramuscularly, or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a tumor or 
lesion. Other modes of administration include oral and pulmonary administration, 
suppositories, and transdermal applications, needles, and gene guns or hyposprays. Dosage 
treatment can be a single dose schedule or a multiple dose schedule. 

Methods for the ex vivo delivery and reimplantation of transformed cells into a 
subject are known in the art and described in e.g., International Publication No. WO 
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93/14778. Examples of cells useful in ex vivo applications include, for example, stem cells, 
particularly hematopoetic, lymph cells, macrophages, dendritic cells, or tumor cells. 
Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 
accomplished by, for example, dextran-mediated transfection, calcium phosphate 
5 precipitation, polybrene mediated transfection, protoplast fusion, electroporation, 

encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 
into nuclei, all well known in the art. 

Once a gene corresponding to a polynucleotide of the invention has been found to 
correlate with a proliferative disorder, such as neoplasia, dysplasia, and hyperplasia, the 
10 disorder can be amenable to treatment by administration of a therapeutic agent based on the 
provided polynucleotide or corresponding polypeptide. 

Preparation of antisense polynucleotides is discussed above. Neoplasias that are 
treated with the antisense composition include, but are not limited to, cervical cancers, 
melanomas, colorectal adenocarcinomas, Wilms' tumor, retinoblastoma, sarcomas, 
1 5 myosarcomas, lung carcinomas, leukemias, such as chronic myelogenous leukemia, 

promyelocytic leukemia, monocytic leukemia, and myeloid leukemia, and lymphomas, such 
as histiocytic lymphoma. Proliferative disorders that are treated with the therapeutic 
composition include disorders such as anhydric hereditary ectodermal dysplasia, congenital 
alveolar dysplasia, epithelial dysplasia of the cervix, fibrous dysplasia of bone, and 
20 mammary dysplasia. Hyperplasias, for example, endometrial, adrenal, breast, prostate, or 
thyroid hyperplasias or pseudoepitheliomatous hyperplasia of the skin, are treated with 
antisense therapeutic compositions based upon a polynucleotide of the invention. Even in 
disorders in which mutations in the corresponding gene are not implicated, downregulation 
or inhibition of expression of a gene corresponding to a polynucleotide of the invention can 
25 have therapeutic application. For example, decreasing gene expression can help to suppress 
tumors in which enhanced expression of the gene is implicated. 

Both the dose of the antisense composition and the means of administration are 
determined based on the specific qualities of the therapeutic composition, the condition, age, 
and weight of the patient, the progression of the disease, and other relevant factors. 
30 Administration of the therapeutic antisense agents of the invention includes local or systemic 
administration, including injection, oral administration, particle gun or catheterized 
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administration, and topical administration. Preferably, the therapeutic antisense composition 

contains an expression construct comprising a promoter and a polynucleotide segment of at 
least 12, 22, 25, 30, or 35 contiguous nucleotides of the antisense strand of a polynucleotide 
disclosed herein. Within the expression construct, the polynucleotide segment is located 
5 downstream from the promoter, and transcription of the polynucleotide segment initiates at 
the promoter. 

Various methods are used to administer the therapeutic composition directly to a 
specific site in the body. For example, a small metastatic lesion is located and the 
therapeutic composition injected several times in several different locations within the body 
10 of tumor. Alternatively, arteries which serve a tumor are identified, and the therapeutic 

composition injected into such an artery, in order to deliver the composition directly into the 

""•» " "ww.uui. wcjuci is aapuaicu *uiu uie composition injected directly 

into the now empty center of the tumor. The antisense composition is directly administered 
to the surface of the tumor, for example, by topical application of the composition. X-ray 
1 5 imaging is used to assist in certain of the above delivery methods. 

Receptor-mediated targeted delivery of therapeutic compositions containing an 
antisense polynucleotide, subgenomic polynucleotides, or antibodies to specific tissues is 
also used. Receptor-mediated DNA delivery techniques are described in, for example, 
Findeis et al, Trends Biotechnol. (1993) 77:202; Chiou et al, Gene Therapeutics: Methods 
20 And Applications Of Direct Gene Transfer (J.A. Wolff, ed.) (1 994); Wu et al. , J. Biol. Chem. 
(1988) 263:621; Wu et al., J. Biol. Chem. (1994) 269:542; Zenke et al., Proc. Natl. Acad. 
Sci. (USA) (1990) 57:3655; Wu et al, J. Biol. Chem. (1991) 266:338. Preferably, receptor- 
mediated targeted delivery of therapeutic compositions containing antibodies of the 
invention is used to deliver the antibodies to specific tissue. 
25 Therapeutic compositions containing antisense subgenomic polynucleotides are 

administered in a range of about 100 ng to about 200 mg of DNA for local administration in 
a gene therapy protocol. Concentration ranges of about 500 ng to about 50 mg, about 1 ug to 
about 2 mg, about 5 ug to about 500 ug, and about 20 ug to about 100 ug of DNA can also 
be used during a gene therapy protocol. Factors such as method of action and efficacy of 
transformation and expression are considerations which will affect the dosage required for 
ultimate efficacy of the antisense subgenomic polynucleotides. Where greater expression is 
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desired over a larger area of tissue, larger amounts of antisense subgenomic polynucleotides 
or the same amounts readministered in a successive protocol of administrations, or several 
administrations to different adjacent or close tissue portions of, for example, a tumor site, 
may be required to effect a positive therapeutic outcome. In all cases, routine 
5 experimentation in clinical trials will determine specific ranges for optimal therapeutic 

effect. A more complete description of gene therapy vectors, especially retroviral vectors, is 
contained in U.S. Serial No. 08/869,309, which is expressly incorporated herein, and in 
section G below. 

For polynucleotide-related genes encoding polypeptides or proteins with anti- 
10 inflammatory activity, suitable use, doses, and administration are described in U.S. Patent 
No. 5,654,173. Therapeutic agents also include antibodies to proteins and polypeptides 
encoded by the polynucleotides of the invention and related genes, as described in U.S. 
Patent No. 5,654,173. 

I. Gene Therapy 

1 5 The therapeutic polynucleotides and polypeptides of the present invention can be 

utilized in gene delivery vehicles. The gene delivery vehicle can be of viral or non-viral 
origin (see generally, Jolly, Cancer Gene Therapy (1994) 7:51; Kimura, Human Gem 
Therapy (1994) 5:845; Connelly, Human Gene Therapy (1995) 7:185; and Kaplitt, Nature 
Genetics (1994) 6:148). Gene therapy vehicles for delivery of constructs including a coding 

20 sequence of a therapeutic of the invention can be administered either locally or systemically . 
These constructs can utilize viral or non- viral vector approaches. Expression of such coding 
sequences can be induced using endogenous mammalian or heterologous promoters. 
Expression of the coding sequence can be either constitutive or regulated. 

The present invention can employ recombinant retroviruses which are constructed to 

25 carry or express a selected nucleic acid molecule of interest. Retrovirus vectors that can be 
employed include those described in EP 0 415 731; WO 90/07936; WO 94/03622; WO 
93/25698; WO 93/25234; U.S. Patent No. 5, 219,740; WO 93/1 1230; WO 93/10218; Vile 
and Hart, Cancer Res. (1993) 53:3860; Vile et al, Cancer Res. (1993) 53:962; Ram et al., 
Cancer Res. (1993) 53:83; Takamiya et al, J. Neurosci. Res. (1992) 33:493; Baba et al., J. 
30 Neurosurg. (1993) 79:729; U.S. Patent No. 4,777,127; GB Patent No. 2,200,651; and EP 0 
345 242. Preferred recombinant retroviruses include those described in WO 91/02805. 
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Packaging cell lines suitable for use with the above-described retroviral vector 

constructs can be readily prepared (see, e.g., WO 95/30763 and WO 92/05266), and used to 
create producer cell lines (also termed vector cell lines) for the production of recombinant 
vector particles. Within particularly preferred embodiments of the invention, packaging cell 
5 lines are made from human (such as HT1 080 cells) or mink parent cell lines, thereby 
allowing production of recombinant retroviruses that can survive inactivation in human 
serum. 

The present invention also employs alphavirus-based vectors that can function as 
gene delivery vehicles. Such vectors can be constructed from a wide variety of alphaviruses, 

1 0 including, for example, Sindbis virus vectors, Semliki forest virus (ATCC VR-67; ATCC 
VR-1247), Ross River virus (ATCC VR-373; ATCC VR-1246) and Venezuelan equine 

, «u ■> v ' " '-j) ^ • *~^y- ¥iv-iiju,ni^ vk. i z*ty; a i ct vk-532). 

Representative examples of such vector systems include those described in U.S. Patent Nos. 
5,091,309; 5,217,879; and 5,185,440; WO 92/10578; WO 94/21792; WO 95/27069; WO 

1 5 95/27044; and WO 95/07994. Gene delivery vehicles of the present invention can also 

employ parvovirus such as adeno-associated virus (AAV) vectors. Representative examples 
include the AAV vectors disclosed by Srivastava in WO 93/09239, Samulski et al., J. Virol. 
(1989) 65:3822; Mendelson et al., Virol. (1988) 766:154; and Flotte et al, PNAS (1993) 
P0.1O613. 

20 Representative examples of adenoviral vectors include those described by Berkner, 

Biotechniques (1988) 6:616; Rosenfeld et al, Science (1991) 252:431; WO 93/19191; Kolls 
et al. , PNAS ( 1 994) 91 :2 1 5; Kass-Eisler et al, PNAS (1 993) 90: 1 1 498; Guzman et al. , 
Circulation (1993) S&2838; Guzman et al.., Cir. Res. (1993) 75:1202; Zabner et al, Cell 
(1993) 75:207; Li et al, Hum. Gene Ther. (1993) *403; Cailaud et al, Eur. J. Neurosci. 

25 (1993) 5:1287; Vincent et al, Nat. Genet. (1993) 5:130; Jaffe et al, Nat. Genet. (1992) 

7 :372; and Levrero et al, Gene (1991) 707:195. Exemplary adenoviral gene therapy vectors 
employable in this invention also include those described in WO 94/12649, WO 93/03769; 
WO 93/19191; WO 94/28938; WO 95/1 1984 and WO 95/00655. Administration of DNA 
linked to killed adenovirus as described in Curiel, Hum. Gene Ther. (1992) 5:147 can be 

30 employed. 



80 



BNSOOCID: <WO_9933982AZJ_> 



WO 99/33982 PCT/US98/27610 
Other gene delivery vehicles and methods can be employed, including polycatiomc 
condensed DNA linked or unlinked to killed adenovirus alone, for example Curiel, Hum. 
Gene Ther. (1992) 5:147; ligand linked DNA, for example see Wu,7. Biol. Chem. (1989) 
26^:16985; eukaryotic cell delivery vehicles cells, for example see U.S. Pat. No. 5,814,482; 
5 WO 95/07994; WO 96/17072; WO 95/30763; and WO 97/42338; deposition of 

photopolymerized hydrogel materials; hand-held gene transfer particle gun, as described in 
U.S. Patent No. 5,149,655; ionizing radiation as described in U.S. Patent No. 5,206,152 and 
in W092/1 1033; nucleic charge neutralization or fusion with cell membranes. Additional 
approaches are described in Philip, Mol. Cell Biol. (1994) 7*241 1, and in Woffendin, Proc. 
1 0 Natl. Acad. Sci. (1 994) 91 : 1 58 1 . 

Naked DNA can also be employed. Exemplary naked DNA introduction methods are 
described in WO 90/1 1092 and U.S. Patent No. 5,580,859. Uptake efficiency can be 
improved using biodegradable latex beads. DNA coated latex beads are efficiently 
transported into cells after endocytosis initiation by the beads. The method can be improved 
15 further by treatment of the beads to increase hydrophobicity and thereby facilitate disruption 
of the endosome and release of the DNA into the cytoplasm. Liposomes that can act as gene 
delivery vehicles are described in U.S. Patent No. 5,422,120; WO 95/13796; WO 94/23697; 
WO 91/14445; and EP 0524968. 

Further non-viral delivery suitable for use includes mechanical delivery systems such 
20 as the approach described in Woffendin et al., Proc. Natl. Acad. Sci. USA (1 994) 

Pi(24):l 1581. Moreover, the coding sequence and the product of expression of such can be 
delivered through deposition of photopolymerized hydrogel materials. Other conventional 
methods for gene delivery that can be used for delivery of the coding sequence include, for 
example, use of hand-held gene transfer particle gun, as described in U.S. Patent No. 
25 5,1 49,655; use of ionizing radiation for activating transferred gene, as described in U.S. 
Patent No. 5,206,152 and WO 92/1 1033. 

The present invention will now be illustrated by reference to the following examples 
which set forth particularly advantageous embodiments. However, it should be noted that 
these embodiments are illustrative and are not to be construed as restricting the invention in 
30 any way. 
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EXAMPLES 

The present invention is now illustrated by reference to the following examples 
which set forth particularly advantageous embodiments. However, these embodiments are 
illustrative and are not meant to be construed as restricting the invention in any way. 

Example 1 : Source of Biological Mater ials and Overview of Novel Polynucleotides 
Expressed b y the Biolopical Materials 
Human colon cancer cell line Kml2L4-A (Morika, W. A. K. et al., Cancer Research 
(1 988) 48.6S63) was used to construct a cDNA library from mRNA isolated from the cells. 
As described in the above overview, a total of 4,693 sequences expressed by the Kml2L4-A 
cell line were isolated and analyzed; most sequences were about 275-300 nucleotides in 
length. The KM12L4-A cell line is derived from the KM12C cell line. The KM12C cell 
line, which is pooriy metastatic (low metastatic) was established in culture from a Dukes' 
stage B 2 surgical specimen (Morikawa et al. Cancer Res. (1988) 45:6863). The KML4-A is 
15 a highly metastatic subline derived from KM12C (Yeatman et al. Nucl. Acids. Res. (1995) 
25:4007; Bao-Ling et al. Proc. Annu. Meet. Am. Assoc. Cancer. Res. (1995) 27:3269). The 
KM12C and KM12C-derived cell lines (e.g., KM12L4, KM12L4-A, etc.) are well- 
recognized in the art as a model cell line for the study of colon cancer (see, e.g., Moriakawa 
et al, supra; Radinsky al. Clin. Cancer Res. (1995) 7:19; Yeatman et al, (1995) supra; 
20 Yeatman et al. Clin. Exp. Metastasis (1996) 14:246). 

The sequences were first masked to eliminate low complexity sequences using the XBLAST 
masking program (Claverie "Effective Large-Scale Sequence Similarity Searches," In: 
Computer Methods for Macromolecu la r S eq u ence Analysis Doolittle, ed., Meth. Enzymol. 
266:212-227 Academic Press, NY, NY (1996); see particularly Claverie, in "Automated 
25 DNA Sequencing and Analysis Techniques" Adams et al, eds., Chap. 36, p. 267 Academic 
Press, San Diego, 1 994 and Claverie et al. Comput. Chem. (1993) 17:191 ). Generally, 
masking does not influence the final search results, except to eliminate of relative little 
interest due to their lox complexity, and to eliminate multiple "hits" based on similarity to 
repetitive regions common to multiple sequences, e.g., Alu repeats. Masking resulted in the 
30 elimination of 43 sequences. The remaining sequences were then used in a BLASTN vs. 
Genbank search with search parameters of greater than 70% overlap, 99% identity, and a p 
value of less than 1 x 10" 40 , which search resulted in the discarding of 1,432 sequences. 
Sequences from this search also were discarded if the inclusive parameters were met, but the 
sequence was ribosomal or vector-derived. 
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The resulting sequences from the previous search were classified into three groups 
(1, 2 and 3 below) and searched in a BLASTX vs. NRP (non-redundant proteins) database 
search: (1) unknown (no hits in the Genbank search), (2) weak similarity (greater than 45% 
identity and p value of less than 1 x KT 5 ), and (3) high similarity (greater than 60% overlap, 
5 greater than 80% identity, and p value less than 1 x 10' 5 ). This search resulted in discard of 
98 sequences as having greater than 70% overlap, greater than 99% identity, and p value of 
less than 1 x 10" 40 . 

The remaining sequences were classified as unknown (no hits), weak similarity, and 
high similarity (parameters as above). Two searches were performed on these sequences. 
1 0 First, a BLAST vs. EST database search resulted in discard of 1 77 1 sequences (sequences 
with greater than 99% overlap, greater than 99% similarity and a p value of less than 1x10" 
40 ; sequences with a p value of less than 1 x 10" 65 when compared to a database sequence of 
human origin were also excluded). Second, a BLASTN vs. Patent GeneSeq database 
resulted in discard of 15 sequences (greater than 99% identity; p value less than 1 x 10" 40 ; 
1 5 greater than 99% overlap). 

The remaining sequences were subjected to screening using other rules and 
redundancies in the dataset. Sequences with a p value of less than 1 x 10 " IM in relation to a 
database sequence of human origin were specifically excluded. The final result provided the 
404 sequences listed in the accompanying Sequence Listing. The Sequence Listing is 
20 arranged beginning with sequences with no similarity to any sequence in a database 
searched, and ending with sequences with the greatest similarity. Each identified 
polynucleotide represents sequence from at least a partial mRNA transcript. Polynucleotides 
that were determined to be novel were assigned a sequence identification number. 

The novel polynucleotides and were assigned sequence identification numbers SEQ 
25 ID NOS: 1-404. The DNA sequences corresponding to the novel polynucleotides are 
provided in the Sequence Listing. The majority of the sequences are presented in the 
Sequence Listing in the 5' to 3' direction. A small number, 25, are listed in the Sequence 
Listing in the 5' to 3' direction but the sequence as written is actually 3' to 5'. These 
sequences are readily identified with the designation "AR" in the Sequence Name in Table 1 
30 (inserted before the claims). The sequences correctly listed in the 5' to 3' direction in the 
Sequence Listing are designated "AF." The Sequence Listing filed herewith therefore 
contains 25 sequences listed in the reverse order, namely SEQ ID NOS:47, 97, 137, 171, 
173, 179, 182, 194, 200, 202, 213, 227, 258, 264, 275, 302, 313, 324, 329, 330, 331, 338, 
358, 379, and 404. 
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Because the provided polynucleotides represent partial mRNA transcripts, two or 

more polynucleotides of the invention may represent different regions of the same mRNA 

transcript and the same gene. Thus, if two or more SEQ ID NOS: are identified as belonging 

to the same clone, then either sequence can be used to obtain the full-length mRNA or gene. 

5 In order to confirm the sequences of SEQ ID NOS: 1 -404, inserts of the clones 

corresponding to these polynucleotides were re-sequenced. These "validation" sequences 

are provided in SEQ ID NOS:405-800. These validation sequences were often longer than 

the original polynucleotide sequences. They validate, and thus often provide additional 

sequence information. Validation sequences can be correlated with the original sequences 

1 0 they validate by identifying those sequences of SEQ ID NOS: 1 -404 and the validation 

sequences of SEQ ID NOS:405-800 that share the same clone name in Table 1 . 

Example 2: Results of Public Database Search to Identify Function of Gene Product*; 

SEQ ID NOS: 1-404, as well as the validation sequences SEQ ID NOS:405-800, were 
1 5 translated in all three reading frames to determine the best alignment with the individual 

sequences. These amino acid sequences and nucleotide sequences are referred, generally, as 
query sequences, which are aligned with the individual sequences. Query and individual 
sequences were aligned using the BLAST programs, available over the world wide web at 
http://ww.ncbi.nlm.nih.gov/BT.AST/ Again the sequences were masked to various extents 
20 to prevent searching of repetitive sequences or poly-A sequences, using the XBLAST 
program for masking low complexity as described above in Example 1. 

Table 2 (inserted before the claims) shows the results of the alignments. Table 2 
refers to each sequence by its SEQ ID NO:, the accession numbers and descriptions of 
nearest neighbors from the Genbank and Non-Redundant Protein searches, and the p values 
25 of the search results. Table 1 identifies each SEQ ID NO: by SEQ name, clone ID, and 

cluster. As discussed above, a single cluster includes polynucleotides representing the same 
gene or gene family, and generally represents sequences encoding the same gene product. 

For each of SEQ ID NOS: 1 -800, the best alignment to a protein or DNA sequence is 
included in Table 2. The activity of the polypeptide encoded by SEQ ID NOS: 1-800 is the 
30 same or similar to the nearest neighbor reported in Table 2. The accession number of the 
nearest neighbor is reported, providing a reference to the activities exhibited by the nearest 
neighbor. The search program and database used for the alignment also are indicated as well 
as a calculation of the p value. 
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Full length sequences or fragments of the polynucleotide sequences of the nearest 

neighbors can be used as probes and primers to identify and isolate the full length sequence 

of SEQ ID NOS: 1-800. The nearest neighbors can indicate a tissue or cell type to be used to 

construct a library for the full-length sequences of SEQ ID NOS: 1-800. 

5 SEQ ID NOS : 1 -800 and the translations thereof may be human homologs of known 

genes of other species or novel allelic variants of known human genes. In such cases, these 

new human sequences are suitable as diagnostics or therapeutics. As diagnostics, the human 

sequences SEQ ID NOS: 1-800 exhibit greater specificity in detecting and differentiating 

human cell lines and types than homologs of other species. The human polypeptides 

10 encoded by SEQ ID NOS: 1-800 are likely to be less immunogenic when administered to 

humans than homologs from other species. Further, on administration to humans, the 

polypeptides encoded by SEQ ID NOS: 1-800 can show greater specificity or can be better 

regulated by other human proteins than are homologs from other species. 



15 Example 3: Members of Protein Families 

After conducting a profile search as described in the specification above, several of the 
polynucleotides of the invention were found to encode polypeptides having characteristics of 
a polypeptide belonging to a known protein families (and thus represent new members of 
these protein families) and/or comprising a known functional domain (Table 3). Thus the 

20 invention encompasses fragments, fusions, and variants of such polynucleotides that retain 
biological activity associated with the protein family and/or functional domain identified 
herein. 

Table 3 Polynucleotides encoding gene products of a protein family or having a known 



functional domain(s). 



SEQ ID 
NO: 


Biological Activity (Profile hit) 


Start 


Stop 


Dir 


24 


4 transmembrane segments integral membrane proteins 


1218 


578 


rev 


41 


4 transmembrane segments integral membrane proteins 


1086 


413 


rev 


101 


4 transmembrane segments integral membrane proteins 


1206 


544 


rev 


157 


4 transmembrane segments integral membrane proteins 


721 


33 


rev 


341 


4 transmembrane segments integral membrane proteins 


1253 


613 


rev 


395 


4 transmembrane segments integral membrane proteins 


530 


10 


for 


395 


4 transmembrane segments integral membrane proteins 


696 


17 


for 


395 


4 transmembrane segments integral membrane proteins 


471 


39 


rev 


24 


7 transmembrane receptor (Secretin family) 


1301 


491 


rev 


41 


7 transmembrane receptor (Secretin family) 


1309 


10 


rev 


101 


7 transmembrane receptor (Secretin family) 


1330 


296 


rev 


157 


7 transmembrane receptor (Secretin family) 


1173 


249 


rev 


291 


7 transmembrane receptor (Secretin family) 


1400 


269 


rev 
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Table 3 Polynucleotides encoding gene products of a protein family or having a known 
functional domain(s). 



SEQ ID 
NO: 


uiuiugiLai /wuviiy yr roiuc mil 


Start 


.Stop 


Dir 


291 


7 transmembrane receptor (Secretin family) 


719 


1 10 

1-31/ 


ror 


305 


7 transmembrane receptor (Secretin family) 


096 


4 


fnr 

ror 


305 


7 transmembrane receptor (Secretin family) 


751 




rev 


315 


7 transmembrane receptor (Secretin family) 


1058 


270 




341 


7 transmembrane receptor (Secretin family) 


1265 


534 


rev 


116 


Ank repeat 


141 


9 1 R 


ror 


251 


Ank repeat 


900 


907 


fnr 

ror 


251 


Ank repeat 


467 
to / 


JO/ 


for 

ror 


63 


Al Pases Assnpiatpfl with \Zarir«ic f'Alliilar A Mn/itiAr 
A L aawa naautiaicu Willi V diiUlia V^CUUlai /\CllVllieS 


jhd 


60 


tor 


116 


ATPaseS Associated with Vnrinnc f^llnlar Ar»ti\/ittc»r 




oil 
313 


tor 


134 


ATPases Associated with Various fVl hilar Artiviti^c 


^9^ 


J 1 


rev 


136 


ATPases Associated with Various Cellular Activities 


719 

r iZi 


161 


fnr 

ror 


151 


ATPases Associated with Various Cellular Activities 


719 


73 


for 


151 


ATPases Associated with Various Cellular Activities 


386 




fnr 


384 


ATPases Associated with Various Cellular Activities 


664 


140 


for 


404 


ATPases Associated with Various Cellular Activities 


704 


52 


for 


374 


Basic region plus leucine zipper transcription factors 


298 


146 


fnr 

lur 


97 


Bromodomain (conserved sequence found in human, 
Drosophila and yeast proteins.) 


230 


61 


fnr 
lur 


136 


EF-hand 


121 


207 


for 


242 


EF-hand 


238 


155 


for 


379 


EF-hand 


212 


126 


for 


308 


Eukaryotic aspartyl proteases 


1300 


461 


rev 


213 


GATA family of transcription factors 


720 


377 


for 


367 


G-protein alpha subunit 


071 


467 

HO / 


rev 


188 


Phorbol esters/diacvlelvcerol hinHina 


Q 1 


1 *7*7 
ill 


tor 


251 


Phorbol esters/diacvlelvcerol hind in a 




7 1 Q 


tor 


202 


nrotein kinase 


A CO 


1 


rev 


202 


protein kinase 


y /u 


i 
1 


rev 


315 


protein kinase 


710 


1 ^£ 


fnr 

ror 


315 


protein kinase 


1023 


197 


for 


367 


protein kinase 


1046 


285 


rev 


397 


protein kinase 


511 


6 


for 


256 


Protein phosphatase 2C 


13 


90 


for 


256 


Protein phosphatase 2C 


163 


86 


for 


382 


Protein Tyrosine Phosphatase 


261 


2 


for 


306 


SH3 Domain 


141 


296 


for 


386 


SH3 Domain 


359 : 


209 i 


for 


169 


Trypsin 


764 


164 i 




188 


WD domain, G-beta repeats * 


180 


*89 


or 


188 


WD domain, G-beta repeats ; 


?06 

£UO 


1 1 7 


Pnr 

ror 


335 


WD domain, G-beta repeats 




59 


or 


23 


ivnt family of developmental signaling proteins 


1151 ; 


535 i 


cv 


291 i 


tvnt family of developmental signaling proteins 


779 1 


i9 i 


•ev 


291 i 


writ family of developmental signaling proteins ] 


1347 : 


582 r 


ev 


324 ^ 


ivnt family of developmental signaling proteins ] 


1180 i 


199 r 


ev 


330 i 


wit family of developmental signaling proteins ] 


180 t 


199 r 


ev 


341 \ 


rat family of developmental signaling proteins ] 


399 f 


>60 r 


ev 
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Table 3 Polynucleotides encoding gene products of a protein family or having a known 



functional domain(s). 



SEQ ID 
NO: 


Biological Activity (Profile hit) 

. . 


atari 






353 


wnt family of developmental signaling proteins 


880 
oo\j 


4Q 




188 


WW/rsp5/WWP domain containing proteins 






for 


379 


W W/rspj/ w w r aomam containing proieiii^ 


12 


89 


for 




W W/rSpj/ W W x UlJIIlalll CUIIlallllllg piUlwllia 


153 


76 


for 


395 


WW/rsp5/WWP domain containing proteins 


156 


64 


for 


61 


Zinc finger, C2H2 type 


254 


192 


for 


306 


Zinc fineer % C2H2 type 


428 


367 


for 1 


386 


Zinc finger, C2H2 type 


191 


253 


for 


322 


Zinc finger, CCHC class 


553 


503 


for 


306 


Zinc-binding metalioprotease domain 


101 


60 


rev 


395 


Zinc-binding metalioprotease domain 


28 


69 


rev 



Start and stop indicate the position within the individual sequenes that align with the 
query sequence having the indicated SEQ ID NO. The direction (Dir) indicates the 
orientation of the query sequence with respect to the individual sequence, where forward 
5 (for) indicates that the alignment is in the same direction (left to right) as the sequence 
provided in the Sequence Listing and reverse (rev) indicates that the alignment is with a 
sequence complementary to the sequence provided in the Sequence Listing. 

Some polynucleotides exhibited multiple profile hits because, for example, the particular 
10 sequence contains overlapping profile regions, and/or the sequence contains two different 
functional domains. These profile hits are described in more detail below. 

a^ Four Transmembrane Integral Membrane Proteins. SEQ ID NOS: 24, 41, 101, 
157, 341, and 395 correspond to a sequence encoding a polypeptide that is a member of the 4 
transmembrane segments integral membrane protein family (transmembrane 4 family). The 
1 5 transmembrane 4 family of proteins includes a number of evolutionarily-related eukaryotic 
cell surface antigens (Levy et al,J. Biol Chem., (1991) 266:14597; Tomlinson et ai, Eur. J. 
Immunol. (1993) 25:136; Barclay etal The leucocyte antigen factbooks. (1993) Academic 
Press, London/San Diego). The proteins belonging to this family include: 1) Mammalian 
antigen CD9 (MIC3), which is involved in platelet activation and aggregation; 2) 
20 Mammalian leukocyte antigen CD37, expressed on B lymphocytes; 3) Mammalian 

leukocyte antigen CD53 (OX-44), which is implicated in growth regulation in hematopoietic 
cells; 4) Mammalian lysosomal membrane protein CD63 (melanoma-associated antigen 
ME491; antigen AD1); 5) Mammalian antigen CD81 (cell surface protein TAPA-1), which 
is implicated in regulation of lymphoma cell growth; 6) Mammalian antigen CD82 (protein 
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R2; antigen C33; Kangai 1 (KAI1)), which associates with CD4 or CD8 and delivers 

costimulatory signals for the TCR/CD3 pathway; 7) Mammalian antigen CD151 (SFA-1; 
platelet-endothelial tetraspan antigen 3 (PETA-3)); 8) Mammalian cell surface glycoprotein 
A15 (TALLA-1; MXS1); 9) Mammalian novel antigen 2 (NAG-2); 10) Human tumor- 
associated antigen CO-029; 1 1) Schistosoma mansoni and japonicum 23 Kd surface antigen 
(SM23 / SJ23). 

The members of the 4 transmembrane family share several characteristics. First, they 
all are apparently type III membrane proteins, which are integral membrane proteins 
containing an N-terminal membrane-anchoring domain which is not cleaved during 
biosynthesis and which functions both as a translocation signal and as a membrane anchor. 
The family members also contain three additional transmembrane regions, at least seven 
conserved cysteines residues, and are of approximately the same size (218 to 284 residues). 
These proteins are collectively know as the "transmembrane 4 superfamily" (TM4) because 
they span plasma membrane four times. A schematic diagram of the domain structure of 
these proteins is as follows: 

+-+ + + — + + + + + + 

1 1 TMa | Extra | TM2| Cyt | TM3 | Extracellular | TM4 | Cyt| 

+-+ + + — c C + CC— C C— + C — + 

********* 

where Cyt is the cytoplasmic domain, TMa is the transmembrane anchor; TM2 to TM4 
represents transmembrane regions 2 to 4, *C are conserved cysteines, and '* 'indicates the 
position of the consensus pattern. The consensus pattern spans a conserved region including 
two cysteines located in a short cytoplasmic loop between two transmembrane domains: 
Consensus pattern: G-x(3)-[LIVMF]-x(2)-[GSA]-[LIVMF](2)-G-C-x-[GA]-[STA]- x(2)- 
[EG]-x(2)-[CWN]-[LIVM](2). 

b) Seven Transmembrane Integral Membrane Proteins. SEQ ID NOS: 24,41, 101, 
1 57, 291 , 305, 3 1 5, and 341 correspond to a sequence encoding a polypeptide that is a 
member of the seven transmembrane receptor family. G-protein coupled receptors 
(Strosberg, Eur, J. Biochem: (1991) 196:1; Keriavage, Curr. Opin. Struct Biol (1991) 
7:394; and Probst et ai, DNA Cell Biol. (1992) 77:1; and Savarese etal, Biochem. J. (1992) 
293:1) (also called R7G) are an extensive group of hormones, neurotransmitters, odorants 
and light receptors which transduce extracellular signals by interaction with guanine 
nucleotide-binding (G) proteins. The tertiary structure of these receptors is thought to be 

highly similar. They have seven hydrophobic regions, each of which most probably spans 
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the membrane. The N-terminus is located on the extracellular side of the membrane and is 
often glycosylated, while the C-terminus is cytoplasmic and generally phosphorylated. 
Three extracellular loops alternate with three intracellular loops to link the seven 
transmembrane regions. Most, but not all of these receptors, lack a signal peptide. The most 
5 conserved parts of these proteins are the transmembrane regions and the first two 

cytoplasmic loops. A conserved acidic-Arg-aromatic triplet is present in the N-terminal 
extremity of the second cytoplasmic loop (Attwood et al, Gene (1991) 98:153) and could 
be implicated in the interaction with G proteins. 

To detect this widespread family of proteins a pattern is used that contains the 
10 conserved triplet and that also spans the major part of the third transmembrane helix. 

Additional information about the seven transmembrane receptor family, and methods for 
their identification and use, is found in U.S. Patent No. 5,759,804. Due in part to their 
expression on the cell surface and other attractive characteristics, seven transmembrane 
protein family members are of particular interest as drug targets, as surface antigen markers, 
15 and as drug delivery targets {e.g. , using antibody-drug complexes and/or use of anti-seven 
transmembrane protein antibodies as therapeutics in their own right). 

c) Ank Repeats. SEQ ID NOS: 1 16 and 251 represent polynucleotides encoding Ank 
repeat-containing proteins. The ankyrin motif is a 33 amino acid sequence named after the 
protein ankyrin which has 24 tandem 33-amino-acid motifs. Ank repeats were originally 
20 identified in the cell-cycle-control protein cdclO (Breeden et al, Nature (1987) 529:651). 

Proteins containing ankyrin repeats include ankyrin, myotropin, I-kappaB proteins, cell cycle 
protein cdclO, the Notch receptor (Matsuno et al., Development (1997) 124(21) -4265); G9a 
(or BAT8) of the class III region of the major histocompatibility complex (Biochem J. 
290:811-818, 1993), FABP, GABP, 53BP2, Linl2, glp-1, SW14, and SW16. The functions 
25 of the ankyrin repeats are compatible with a role in protein-protein interactions (Bork, 

Proteins (1993) 17(4):363; Lambert and Bennet, Eur. J. Biochem. (1993) 211:1; Kerr et al, 
Current Op. Cell Biol. (1992) 4:496; Bennet et al, J. Biol. Chem. (1980) 255:6424). 

The 90 kD N-terminal domain of ankyrin contains a series of 24 33-amino-acid ank 
repeats. (Lux et al, Nature (1990) 5^:36-42, Lambert et al, PNAS USA (1990) 57:1730.) 
30 The 24 ank repeats form four folded subdomains of 6 repeats each. These four repeat 
subdomains mediate interactions with at least 7 different families of membrane proteins. 
Ankyrin contains two separate binding sites for anion exchanger dimers. One site utilizes 
repeat subdomain two (repeats 7-12) and the other requires both repeat subdomains 3 and 4 

(repeats 13-24). Since the anion exchangers exist in dimers, ankyrin binds 4 anion 
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exchangers at the same time. (Michaely and Bennett,./ Biol. Chem. (1995) 270fJ7;.-22O5O) 
The repeat motifs are involved in ankyrin interaction with tubulin, spectrin, and other 
membrane proteins. (Lux et al., Nature (1990) 344:36.) 

The Rel/NF-kappaB/Dorsal family of transcription factors have activity that is 
controlled by sequestration in the cytoplasm in association with inhibitory proteins referred 
to as I-kappaB. (Gilmore, Cell (1 990) 52:841 ; Nolan and Baltimore, Curr Opin Genet Dev. 
(1992) 2:21 1; Baeuerle, Biochim BiophysActa (1991) 7072:63; Schmitz et al., Trends Cell 
Biol. (1991) 7:130.) I-kappaB proteins contain 5 to 8 copies of 33 amino acid ankyrin 
repeats and certain NF-kappaB/rel proteins are also regulated by cis-acting ankyrin repeat 
containing domains including P 105NF-kappaB which contains a series of ankyrin repeats 
(Diehl and Hannink, J. Virol. (1993) <57f72;:7161). The I-kappaBs and Cactus (also 
containing ankyrin repeats) inhibit activators through differential interactions with the Rel- 
homology domain. The gene family includes proto-oncogenes, thus broadly implicating I- 
kappaB in the control of both normal gene expression and the aberrant gene expression that 
makes cells cancerous. (Nolan and Baltimore, Curr Opin Genet Dev. (1992) 2(2):2\ 1-220). 
In the case of rel/NF-kappaB and pp40/I-kappaBp, both the ankyrin repeats and the carboxy- 
terminal domain are required for inhibiting DNA-binding activity and direct association of 
pp40/I-kappaBp with rel/NF-kappaB protein. The ankyrin repeats and the carboxy-terminal 
of pp40/I-kappaBp ( form a structure that associates with the rel homology domain to inhibit 
DNA binding activity (Inoue et al., PNAS USA (1992) SP.-4333). 

The 4 ankyrin repeats in the amino terminus of the transcription factor subunit 
GABPp are required for its interaction with the GABPct subunit to form a functional high 
affinity DNA-binding protein. These repeats can be crosslinked to DNA when GABP is 
bound to its target sequence. (Thompson et al., Science (1991) 255:762; LaMarco et al., 
Science (1991) 255:789). 

Myotrophin, a 12.5 kDa protein having a key role in the initiation of cardiac 
hypertrophy, comprises ankyrin repeats. The ankyrin repeats are characteristic of a hairpin- 
like protruding tip followed by a helix-turn-helix motif. The V-shaped helix-turn-helix of 
the repeats stack sequentially in bundles and are stabilized by compact hydrophobic cores, 
whereas the protruding tips are less ordered. 

d ) ATPases Associated with Various Cellular Activities fAAA^ SEQ ID NOS: 63, 
1 16, 134, 136, 151, 384, and 404 polynucleotides encoding novel members of the "ATPases 
Associated with diverse cellular Activities" (AAA) protein family The AAA protein family 
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is cZoseTof a large number of ATPases that share a conserved region of about 220 amino 
acids that contains an ATP-binding site (Froehlich et al, J. Cell Biol. (1991) 774:443; 
Erdmann etal. Cell (1991) 6*499; Peters et al, EMBOJ. (1990) 9:1757; Kunau era/., 
Biochimie (1993) 75:209-224; Confalonieri et al, BioEssays (1995) 77:639; 
htt p:// y eamob.pci.chem^^ The proteins that 

belong to this family either contain one or two AAA domains. 

Proteins containing two AAA domains include: 1) Mammalian and drosophila NSF 
(N-ethylmaleimide-sensitive fusion protein) and the fungal homolog, SEC 18, which are 
involved in intracellular transport between the endoplasmic reticulum and Golgi, as well as 
between different Golgi cisternae; 2) Mammalian transitional endoplasmic reticulum 
ATPase (previously known as P 97 or VCP), which is involved in the transfer of membranes 
from the endoplasmic reticulum to the golgi apparatus. This ATPase forms a ring-shaped 
homooligomer composed of six subunits. The yeast homolog, CDC48, plays a role m 
spindle pole proliferation; 3) Yeast protein PAS1 essential for peroxisome assembly and the 
related protein PAS1 from Pichia pastoris; 4) Yeast protein AFG2; 5) Sulfolobus 
acidocaldarius protein SAV and Halobacterium salinarium cdcH, which may be part of a 
transduction pathway connecting light to cell division. 

Proteins containing a single AAA domain include: 1) Escherichia coli and other 
bacteria ftsH (or hflB) protein. FtsH is an ATP-dependent zinc metallopeptidase that 
degrades the heat-shock sigma-32 factor, and is an integral membrane protein with a large 
cytoplasmic C-terminal domain that contain both the AAA and the protease domams; 2) 
Yeast protein YME1, a protein important for maintaining the integrity of the mitochondnal 
compartment. YME1 is also a zinc-dependent protease; 3) Yeast protein AFG3 (or YTA10). 
This protein also contains an AAA domain followed by a zinc-dependent protease domain; 
4) Subunits from regulatory complex of the 26S proteasome (Hilt et al, Trends Biochem. 
Sci (1996) 27 :96), which is involved in the ATP-dependent degradation of ubiquitinated 
proteins, which subunits include: a) Mammalian 4 and homologs in other higher eukaryotes, 
in yeast (gene YTA5) and fission yeast (gene mts2); b) Mammalian 6 (TBP7) and homologs 
in other higher eukaryotes and in yeast (gene YTA2); c) Mammalian subunit 7 (MSS1) and 
homologs in other higher eukaryotes and in yeast (gene CIM5 or YTA3); d) Mammalian 
subunit 8 (P45) and homologs in other higher eukaryotes and in yeast (SUG1 or CIM3 or 
TBY1) and fission yeast (gene letl); e) Other probable subunits include human TBP1, which 
influences HIV gene expression by interacting with the virus tat transactivator protein, and 
yeast YTA1 and YTA6; 5) Yeast protein BCS1, a mitochondrial protein essential for the 
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expression of the R*e iron-sulfur protein; 6) Yeast proteinlsPl, a proteinTo" 
mitochondrial sorting of proteins; 7) Yeast protein PAS 8, and the corresponding 
protems PASS from Pichia pastoris and PA Y4 from Yarrowia lipolytica; 8) Mouse protein 
SKD1 and rts fission yeast homolog (SpAC2Gl 1.06); 9) Caenorhabditis elegans meiotic 
spmdleformationproteinmei-1; 10) Yeast protein SAP1 ' 1 1) Yeast protein YTA7- and 12) 
Mycobacterium leprae hypothetical protein A2 126A. 

In general, the AAA domains in these proteins act as ATP-dependent protein 
clamps(Confalonieri e ^/. (1995) BioEssays J 7:639). In addition to the ATP-binding 'A' and 
m ° tlfS ' Whkh m l0Cated in * e N-terminal half of this domain, there is a highly 
10 conserved region located in the central part of the domain which was used in the 

development of the signature pattern. The consensus pattern is: [LIVMT]-x-fLIVMTl- 
[LIVMF]-x-[GATMC]-[STj-[NS]-x(4)-[LIVM]-D-x-A-[LIFA]-x-R. 

e) Basic Region Pius Leucine Zipper Transep t,™ v^. SEQIDN0374 
correspond to a polynucleotide encoding a novel member of the family of basic region plus 
15 leucme zipper transcription factors. The bZIP superfamily (Hurst, ProteinProf. (1995) 
2:105; and Ellenberger, Cur, Opin. Struct. Biol. (1994) 4:12) of eukaryotic DNA-binding 
transection factors encompasses proteins that contain a basic region mediating sequence- 
specific DNA-binding followed by a leucine zipper required for dimerization. Members of 
the family include transcription factor AP-1, which binds selectively to enhancer elements in 
■0 the czs control regions of SV40 and metallothionein IIA. AP-1, al so known as c-jun is the 
cellular homolog of the avian sarcoma virus 1 7 (ASV1 7) oncogene v-jun. 

Other members of this protein family include jun-B and jun-D, probable 
transcription factors that are highly similar to jun/AP- 1 ; the fos protein, a proto-oncogene 
that forms a non-covalent dimer with c-jun; the fos-related proteins fra-1, and fos B; and 
5 mammalian cAMP response element (CRE) binding proteins CREB, CREM ATF-1 ATF 
3, ATF-4, ATF-5, ATF-6 and LRF-1. The consensus pattern for this protein family is - 
[KR]-x(l,3)-[RKSAQ]-N- X (2)-[SAQ](2)-x-[RKTAENQ]-x-R-x-[RK]. 

f) Bromodomain, SEQ ID NO:97 corresponds to a polynucleotide encoding a 
polypeptide having a bromodomain region (Haynes et al., 1992, Nucleic Acids Res 
» 20:2693-2603, Tamkun et al., 1992, Cell 68:561-572, and Tamkun, 1995, Curr. Opi, Genet 
Dev. 5:473-477), which is a conserved region of about 70 amino acids found in the 
followmg proteins: 1) Higher eukaryotes transcription initiation factor TFIID 250 Kd 
subumt (TBP-associated factor P 250) (gene CCG1); P250 is associated with the TFIID 
TATA-box binding protein and seems essential for progression of the Gl phase of the cell 
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cy !le ^Human RINGS, a protein of unknown function encoded in the MHC class II locus; 
3) Mammalian CREB-binding protein (CBP), which mediates cAMP-gene regulauon by 
binding specifically to ohosphorylated CREB protein; 4) Mammalian homologs of brahma, 
including three brahma-like human: SNF2a(hBRM), SNF2b, and BRG1; 5) Human BS69, 
a protein that binds to adenovirus El A and inhibits El A transaction; 6) Human peregrm 
(or Br 140). 

The bromodomain is thought to be involved in protein-protein interactions and may 
be important for the assembly or activity of multicomponent complexes involved in 
transcriptional activation. The consensus pattern, which spans a major part of the 
bromodomain, is: [S TANVFl-x(2)^ 

[ LIVMFY]-x(3)-[LIVM]-x(4)-[LIVM]-x(6,8)-Y-x(12,13)-[LIVM]-x(2).N-[SACF]-x(2)- 
[FY]. 

g) F.F-Hand. SEQ ID NOS:136, 242, and 379 correspond to polynucleotides 
encoding a novel protein in the family of EF-hand proteins. Many calcium-binding proteins 
belong to the same evolutionary family and share a type of calcium-binding domain known 
as the EF-hand (Kawasaki et al, Protein. Prof. (1995) 2:305-490). This type of domain 
consists of a twelve residue loop flanked on both sides by a twelve residue alpha-hehcal 
domain. In an EF-hand loop the calcium ion is coordinated in a pentagonal bipyramidal 
configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; 
these residues are denoted by X, Y, Z, -Y, -X and -Z. The invariant Glu or Asp at position 
12 provides two oxygens for liganding Ca (bidentate ligand). 

Proteins known to contain EF-hand regions include: Calmodulin (Ca=4, except m 
yeast where Ca=3) ("Ca=" indicates approximate number of EF-hand regions); 
diacylglycerol kinase (EC 2.7.1.107) (DGK) (Ca=2); 2) FAD-dependent glycero- 
phosphate dehydrogenase (EC 1 .1.99.5) from mammals (Ca=l); guanylate cyclase activating 
protein (GCAP) (Ca=3); MIF related proteins 8 (MRP-8 or CFAG) and 14 (MRP-14) 
(Ca=2); myosin regulatory light chains (Ca=l); oncomodulin (Ca=2); osteonectin (basement 
membrane protein BM-40) (SPARC); and proteins that contain an "osteonectin" domain 

(QR1, matrix glycoprotein SCI). 

The consensus pattern includes the complete EF-hand loop as well as the first residue 
which follows the loop and which seem to always be hydrophobic. 

Consensus pattern: D-x-[DNS]-{ILVFYW}-[DENSTG]-[DNQGHRK]-{GP}- 
[L1VMC]-PENQSTAGC]-X(2)-PE]-[LIVMFYW] 
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h) E^micA^^o^ SEQ ID NO:308™esponds to a 
novel eukaryotic aspartyl protease. Aspartyl proteases, known as acid proteases, (EC 
3.4.23.-) are a widely distributed family of proteolytic enzymes (Foltmann B. Essays 
Biochem. (1981) 77:52; Davies D.R., Annu. Rev. Biophys. Chem. (1990) 7P.189; Rao 
J.K.M., et al, Biochemistry (1991) 30:4663) known to exist in vertebrates, fungi plants 
retroviruses and some plant viruses. Aspartate proteases of eukaryotes are monomeric 
enzymes which consist of two domains. Each domain contains an active site centered on a 
catalyuc aspartyl residue. The two domains most probably evolved from the duplication of 
an ancestral gene encoding a primordial domain. Currently known eukaryotic aspartyl 
proteases include: 1) Vertebrate gastric pepsins A and C (also known as gastricsin); 2) 
Vertebrate chymosin (rennin), involved in digestion and used for making cheese- 3) 
Vertebrate lysosomal cathepsins D (EC 3.4.23.5) and E (EC 3.4.23.34); 4) Mammalian renin 
(EC 3.4.23.15) whose function is to generate angiotensin I from angiotensinogen in the 
Plasma; 5) Fungal proteases such as aspergillopepsin A (EC 3.4.23.18), candidapepsin (EC 
3.4.23.24), mucoropepsin (EC 3.4.23.23) (mucor rennin), endothiapepsin (EC 3 4 23 22) 
polyporopepsin (EC 3.4.23.29), and rhizopuspepsin (EC 3.4.23.21); and 6) Yeast 
saccharopepsin (EC 3.4.23.25) (proteinase A) (gene PEP4). PEP4 is implicated in 
posttranslational regulation of vacuolar hydrolases; 7) Yeast barrierpepsin (EC 3.4.23 35) 
(gene BAR1); a protease that cleaves alpha-factor and thus acts as an antagonist of the 
mating pheromone; and 8) Fission yeast sxal which is involved in degrading or processing 
the mating pheromones. 

Most retroviruses and some plant viruses, such as badnaviruses, encode for an 
aspartyl protease which is an homodimer of a chain of about 95 to 125 amino acids. In most 
retrovxruses, the protease is encoded as a segment of a polyprotein which is cleaved during 
the maturation process of the virus. It is generally part of the pol polyprotein and, more 
rarely, of the gag polyprotein. Because the sequence around the two aspartates of eukaryotic 
aspartyl proteases and around the single active site of the viral proteases is conserved a 
smgle signature pattern can be used to identify members of both groups of proteases '^e 
consensus pattern is: [LIVMFGAC]-[LIVMTADN]-[LIVFSA]-D-[ST]-G-[STAV] 
[STAPDENQ]- x-[LIVMFSTNC]-x-[LIVMFGTA], where D is the active site residue. 

0 GATA Family of Transcrip tion Factors SEQ ID NO :2 13 corresponds to a novel 
member of the GATA family of transcription factors. The GATA family of transcription 
factors are proteins that bind to DNA sites with the consensus sequence (A/T)GATA( A/G) 
found within the regulatory region of a number of genes. Proteins currently known to belong 
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to teSl: 1) GATA-1 (Trainor, CD., et al, Nature (1990) 343:92) (also known as 
Eryfl, GF-1 or NF-E1), which binds to the GATA region of globin genes and other genes 
expressed in erythroid cells. It is a transcriptional activator which probably serves as a 
general •switch 1 factor for erythroid development; 2) GATA-2 (Lee, M.E., et al, J. Biol. 
5 Chem. (1991) 2(56:16188), a transcriptional activator which regulates endothelin-1 gene 
expression in endothelial cells; 3) GATA-3 (Ho, I.-C, et al, EMBOJ. (1991) 70:1 187), a 
transcriptional activator which binds to the enhancer of the T-cell receptor alpha and delta 
genes; 4) GATA-4 (Spieth, J., et al, Mol Cell Biol (1991) 77:4651), a transcriptional 
activator expressed in endodermally derived tissues and heart; 5) Drosophila protein pannier 
1 0 (or DGATAa) (gene pnr) which acts as a repressor of the achaete-scute complex (as-c); 6) 
Bombyx mori BCFI (Drevet, J.R., et al,J. Biol Chem. (1994) 269:10660), which regulates 
the expression of chorion genes; 7) Caenorhabditis elegans elt-1 and elt-2, transcriptional 
activators of genes containing the GATA region, including vitellogenin genes (Hawkins, 
M.G., et al.,J. Biol Chem. (1995) 270:14666); 8) Ustilago maydis urbsl (Voisard, C.P.O., 
15 et al, Mol. Cell Biol. (1993) 75:7091), a protein involved in the repression of the 
biosynthesis of siderophores; 9) Fission yeast protein GAF2. 

All these transcription factors contain a pair of highly similar 'zinc finger 1 type 
domains with the consensus sequence C-x2-C-xl7-C-x2-C. Some other proteins contain a 
single zinc finger motif highly related to those of the GATA transcription factors. These 
20 proteins are: 1) Drosophila box A-binding factor (ABF) (also known as protein serpent 

(gene srp)) which may function as a transcriptional activator protein and may play a key role 
in the organogenesis of the fat body; 2) Emericella nidulans are (Arst, H.N., Jr., et al, 
Trends Genet. (1989) 5:291) a transcriptional activator which mediates nitrogen metabolite 
repression; 3) Neurospora crassa nit-2 (Fu, Y.-H., et al, Mol. Cell. Biol (1990) 70:1056), a 
25 transcriptional activator which turns on the expression of genes coding for enzymes required 
for the use of a variety of secondary nitrogen sources, during conditions of nitrogen 
limitation; 4) Neurospora crassa white collar proteins 1 and 2 (WC-1 and WC-2), which 
control expression of light-regulated genes; 5) Saccharomyces cerevisiae DAL81 (or 
UGA43), a negative nitrogen regulatory protein; 6) Saccharomyces cerevisiae GLN3, a 
30 positive nitrogen regulatory protein; 7) Saccharomyces cerevisiae GAT1 ; 8) Saccharomyces 
cerevisiae GZF3. 

The consensus pattern for the GATA family is: C-x-[DN]-C-x(4,5)-[ST]-x(2)-W- 
[HR]-[RK]-x(3)-[GN]-x(3,4)-C-N-[AS]-C, where the four Cs are zinc ligands. 
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j) G-Protein Alpha Snhunit SEQ ID NO:367 corre^onds to a gene encoding a novel 
polypeptide of the G-protein alpha subunit family. Guanine nucleotide binding proteins (G- 
proteins) are a family of membrane-associated proteins that couple extracellularly-activated 
integral-membrane receptors to intracellular effectors, such as ion channels and enzymes that 
vary the concentration of second messenger molecules. G-proteins are composed of 3 
subunits (alpha, beta and gamma) which, in the resting state, associate as a trimer at the inner 
face of the plasma membrane. The alpha subunit has a molecule of guanosine diphosphate 
(GDP) bound to it. Stimulation of the G-protein by an activated receptor leads to its 
exchange for GTP (guanosine triphosphate). This results in the separation of the alpha from 
the beta and gamma subunits, which always remain tightly associated as a dimer. Both the 
alpha and beta-gamma subunits are then able to interact with effectors, either individually or 
in a cooperative manner. The intrinsic GTPase activity of the alpha subunit hydropses the 
bound GTP to GDP. This returns the alpha subunit to its inactive conformation and allows it 
to reassociate with the beta-gamma subunit, thus restoring the system to its resting state. 

G-protein alpha subunits are 350-400 amino acids in length and have molecular 
weights in the range 40-45 kDa. Seventeen distinct types of alpha subunit have been 
identified in mammals. These fall into 4 main groups on the basis of both sequence 
similarity and function: alpha-s, alpha-q, alpha-i and alpha-I2 (Simon et al, Science (1993) 
252:802). Many alpha subunits are substrates for ADP-ribosylation by cholera or pertussis 
toxins. They are often N-terminally acylated, usually with myristate and/or palmitoylate, 
and these fatty acid modifications are probably important for membrane association and ' 
high- affinity interactions with other proteins. The atomic structure of the alpha subunit of 
the G-protein involved in mammalian vision, transducin, has been elucidated in both GTP- 
and GDB-bound forms, and shows considerable similarity in both primary and tertiary 
structure in the nucleotide-binding regions to other guanine nucleotide binding proteins, such 
as p2 1 -ras and EF-Tu. 

k) P horbol Esters/Diacvlglvceml RinHi™ SEQ ID NO: 1 88 and 25 1 represent 
polynucleotides encoding a protein belonging to the family including phorbol 
esters/diacylglycerol binding proteins. Diacylglycerol (DAG) is an important second 
messenger. Phorbol esters (PE) are analogues of DAG and potent tumor promoters that 
cause a variety of physiological changes when administered to both cells and tissues. DAG 
activates a family of serine/threonine protein kinases, collectively known as protein kinase C 
(PKC) (Azzi et al, Eur. J. Biochem. (1992) 205:547). Phorbol esters can directly stimulate 
PKC. The N-terminal region of PKC, known as CI, has been shown (Ono et al Proc Natl 
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Acad. Sci. USA (1989) £6:4868) to bind PE and DAG in a phospholipid and zinc-dependent 

fashion. The CI region contains one or two copies (depending on the isozyme of PKC) of a 

cysteine-rich domain about 50 amino-acid residues long and essential for DAG/PE-binding. 

Such a domain has also been found in, for example, the following proteins. 

(1) Diacylglycerol kinase (EC 2.7.1.107) (DGK) (Sakane et al, Nature (1990) 
344:345), the enzyme that converts DAG into phosphatidate. It contains two copies of the 
DAG/PE-binding domain in its N-terminal section. At least five different forms of DGK are 

known in mammals; and 

(2) N-chimaerin, a brain specific protein which shows sequence similarities with the 
BCR protein at its C-terminal part and contains a single copy of the DAG/PE-binding 
domain at its N-terminal part. It has been shown (Ahmed et al, Biochem. J. (1990) 272:767, 
and Ahmed et al, Biochem. J. (1991) 250:233) to be able to bind phorbol esters. 

The DAG/PE-binding domain binds two zinc ions; the ligands of these metal ions are 
probably the six cysteines and two histidines that are conserved in this domain. The 
signature pattern completely spans the DAG/PE domain. The consensus pattern is: H-x- 
[LIVMFYW]-x(8,ll)-C-x(2)-C-x(3)-[LIVMFC]-x(5,10)-C-x(2)-C-x(4)-[HD]-x(2)-C-x(5,9)- 

C. All the C and H are probably involved in binding zinc. 

1) Protein Kinase. SEQ ID NOS:202, 315, 367, and 397 represent polynucleotides 
encoding protein kinases. Protein kinases catalyze phosphorylation of proteins in a variety of 
pathways, and are implicated in cancer. Eukaryotic protein kinases (Hanks S.K., et al, 
FASEB J. (1995) 9:576; Hunter T., Meth. Enzymol (1991) 200:3; Hanks S.K., et al, Meth. 
Enzymol (1991) 200:38; Hanks S.K., Curr. Opin. Struct. Biol (1991) 7:369; Hanks S.K., et 
al, Science (1988) 247:42) are enzymes that belong to a very extensive family of proteins 
which share a conserved catalytic core common to both serine/threonine and tyrosine protein 
kinases. There are a number of conserved regions in the catalytic domain of protein kinases. 
Two of the conserved regions are the basis for the signature pattern in the protein kinase 
profile. The first region, which is located in the N-terminal extremity of the catalytic 
domain, is a glycine-rich stretch of residues in the vicinity of a lysine residue, which has 
been shown to be involved in ATP binding. The second region, which is located in the 
central part of the catalytic domain, contains a conserved aspartic acid residue which is 
important for the catalytic activity of the enzyme (Knighton D.R., et al, Science (1991) 
253:407). The protein kinase profile includes two signature patterns for this second region: 
one specific for serine/threonine kinases and the other for tyrosine kinases. A third profile is 
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based on the alignment in (Hanks S.K., et al, FASEBJ. (1995) 9:576) and covers the entire 

catalytic domain. The consensus patterns are as follows: 

1) Consensus pattern: [LIV]-G-{P}-G-{P}-[FYWMGSTNH]-[SGA]-{PW}- 
[LIVCAT]-{PD}-x-[GSTACLIVMFY]-x(5,18)-[LIVMFYWCSTAR]-[AIVP]- 
[LIVMFAGCKR]-K, where K binds ATP. The majority of known protein kinases are 
detected by this pattern. Proteins kinases that are not detected by this consensus include 
viral kinases, which are quite divergent in this region and are completely missed by this 
pattern. 

2) Consensus pattern: [LIVMFYC]-x-[HY]-x.D-[LIVMFY]-K-x(2)-N- 
[LIVMFYCT](3), where D is an active site residue. This consensus sequence identifies most 
serine/threonine-specific protein kinases with only 10 exceptions. Half of the exceptions are 
viral kinases, while the other exceptions include Epstein-Barr virus BGLF4 and Droso P hi!a 
ninaC, which have Ser and Arg, respectively, instead of the conserved Lys. These latter two 
protein kinases are detected by the tyrosine kinase specific pattern described below. 

3) Consensus pattern: [LIVMFYC]-x-[HY]-x-D-[LIVMFY]-[RSTAC]-x(2)-N- 
[LIVMFYC], where D is an active site residue. All tyrosine-specific protein kinases are 
detected by this consensus pattern, with the exception of human ERBB3 and mouse blk. 
This pattern also detects most bacterial aminoglycoside phosphotransferases (Benner S., 
Nature (1987) 329:21; Kirby R., J. Mol Evol (1992) 50:489) and herpesviruses ganciclovir 
kinases (Littler E., et al., Nature (1992) 555:160), which are structurally and evolutionary 
related to protein kinases. 

The protein kinase profile also detects receptor guanylate cyclases and 2-5A- 
dependent ribonucleases. Sequence similarities between these two families and the 
eukaryotic protein kinase family have been noticed previously. The profile also detects 
Arabidopsis thaliana kinase-like protein TMKL1 which seems to have lost its catalytic 
activity. 

If a protein analyzed includes the two of the above protein kinase signatures, the 
probability of it being a protein kinase is close to 100%. Eukaryotic-type protein kinases 
have also been found in prokaryotes such as Myxococcus xanthus (Munoz-Dorado J., et al, 
Cell (1991) 67:995) and Yersinia pseudotuberculosis. The patterns shown above has been 
updated since their publication in (Bairoch A., et al, Nature (1988) 557:22). 

m) Protein Phosphatase ?C„ SEQ ID N0.256 corresponds to a polynucleotide 
encoding a novel protein phosphatase 2C (PP2C), which is one of the four major classes of 
mammalian serine/threonine specific protein phosphatases. PP2C (Wenk et al FEBSLett. 
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(1992) 297:135) is a monomeric enzyme of about 42 Kd which shows broad substrate 
specificity and is dependent on divalent cations (mainly manganese and magnesium) for its 
activity. Three isozymes are currently known in mammals: PP2C-alpha, -beta and -gamma, 
n) Protein Tyrosine Phosphatase. SEQ ID NO:382 represents a polynucleotide 
5 encoding a protein tyrosine kinase. Tyrosine specific protein phosphatases (EC 3.1 .3.48) 
(PTPase) (Fischer et al, Science (1991) 253:401; Charbonneau et al.,Annu. Rev. Cell Biol. 
(1992) 5:463; Trowbridge, J. Biol. Chem. (1991) 255:23517; Tonks et al, Trends Biochem. 
Sci. (1989) 14:497; and Hunter, Cell (1989) 55:1013) catalyze the removal of a phosphate 
group attached to a tyrosine residue. These enzymes are very important in the control of cell 
10 growth, proliferation, differentiation and transformation. Multiple forms of PTPase have 
been characterized and can be classified into two categories: soluble PTPases and 
transmembrane receptor proteins that contain PTPase domain(s). 

Soluble PTPases include PTPN3 (HI) and PTPN4 (MEG), enzymes that contain an 
N-terminal band 4.1 -like domain and could act at junctions between the membrane and 
15 cytoskeleton; PTPN6 (PTP-1C; HCP; SHP) and PTPN1 1 (PTP-2C; SH-PTP3; Syp), 
enzymes that contain two copies of the SH2 domain at its N-terminal extremity. 

Dual specificity PTPases include DUSP1 (PTPN10; MAP kinase phosphatase- 1; 
MKP-1) which dephosphorylates MAP kinase on both Thr-183 and Tyr-185; and DUSP2 
(PAC-1), a nuclear enzyme that dephosphorylates MAP kinases ERK1 and ERK2 on both 
20 Thr and Tyr residues. 

Structurally, all known receptor PTPases are made up of a variable length 
extracellular domain, followed by a transmembrane region and a C-terminal catalytic 
cytoplasmic domain. Some of the receptor PTPases contain fibronectin type III (FN-III) 
repeats, immunoglobulin-like domains, MAM domains or carbonic anhydrase-like domains 
25 in their extracellular region. The cytoplasmic region generally contains two copies of the 

PTPAse domain. The first seems to have enzymatic activity, while the second is inactive but 
seems to affect substrate specificity of the first. In these domains, the catalytic cysteine is 
generally conserved but some other, presumably important, residues are not. 

PTPase domains consist of about 300 amino acids. There are two conserved 
30 cysteines and the second one has been shown to be absolutely required for activity. 

Furthermore, a number of conserved residues in its immediate vicinity have also been shown 
to be important. The consensus pattern for PTPases is: [LIVMF]-H-C-x(2)-G-x(3)-[STC]- 
[STAGP]-x-[LIVMFY]; C is the active site residue. 

99 



BNSDOCID: <WO 9933982A2_L> 



WO 99/33982 PCT/US98/27610 
o) SH3 Domain. SEQ ID NO.306 and 386 represent polynucleotides encoding SH3 

domain proteins. The Src homology 3 (SH3) domain is a small protein domain of about 60 
amino acid residues first identified as a conserved sequence in the non-catalytic part of 
several cytoplasmic protein tyrosine kinases (e.g. Src, Abl, Lck) (Mayer et al, Nature (1988) 
332:272). The domain has also been found in a variety of intracellular or membrane- 
associated proteins (Musacchio et al, FEBSLett. (1992) 507:55; Pawson et al., Curr. Biol. 
(1993) 5:434; Mayer et al, Trends Cell Biol (1993) 5:8; and Pawson et al, Nature (1995) 
575:573). 

The SH3 domain has a characteristic fold that consists of five or six beta-strands 
arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain 
short helices (Kuriyan et al, Curr. Opin. Struct. Biol (1993) 5:828). It is believed that SH3 
domain-containing proteins mediate assembly of specific protein complexes via binding to 
proline-rich peptides (Morton et al, Curr. Biol (1994) 4:615). In general, SH3 domains are 
found as single copies in a given protein, but there is a significant number of proteins with 
two SH3 domains and a few with 3 or 4 copies. 

SH3 domains have been identified in, for example, protein tyrosine kinases, such as 
the Src, Abl, Bkt, Csk and ZAP70 families of kinases; mammalian phosphatidylinositol- 
specific phospholipase C-gamma-1 and -2; mammalian phosphatidyl inositol 3-kinase 
regulatory p85 subunit; mammalian Ras GTPase-activating protein (GAP); mammalian Vav 
oncoprotein, a guanine nucleotide exchange factor of the CDC24 family; Drosophila 
lethal(l)discs large-1 tumor suppressor protein (gene Dlgl); mammalian tight junction 
protein ZO-1 ; vertebrate erythrocyte membrane protein p55; Caenorhabditis elegans protein 
lin-2; rat protein CASK; and mammalian synaptic proteins SAP90/PSD-95, CHAPSYN- 
1 10/PSD-93, SAP97/DLG1 and SAP102. Novel SH3-domain containing polypeptides will 
facilitate elucidation of the role of such proteins in important biological pathways, such as 
ras activation. 

p) Trypsin. SEQ ID NO. l 69 corresponds to a novel serine protease of the trypsin 

family. The catalytic activity of the serine proteases from the trypsin family is provided by a 

charge relay system involving an aspartic acid residue hydrogen-bonded to a histidine, which 

itself is hydrogen-bonded to a serine. The sequences in the vicinity of the active site serine 

and histidine residues are well conserved in this family of proteases (Brenner S., Nature 

(1988) 554:528). Proteases known to belong to the trypsin family include: 1) Acrosin; 2) 

Blood coagulation factors VII, DC, X, XI and XII, thrombin, plasminogen, and protein C; 3) 

Cathepsin G; 4) Chymotrypsins; 5) Complement components Clr, Cls, C2, and complement 

100 



PCT/US98/27610 

facTrsTo and I; 6) Complement-activating component of RA-reactive factor; 7) Cytotoxic 
cell proteases (granzymes A to H); 8) Duodenase I; 9) Elastases 1, 2, 3A, 3B (protease E), 
leukocyte (medullasin).; 10) Enterokinase (EC 3.4.21 .9) (enteropeptidase); 1 1) Hepatocyte 
growth factor activator; 12) Hepsin; 13) Glandular (tissue) kallikreins (including EGF- 
5 binding protein types A, B, and C, NGF-gamma chain, gamma-renin, prostate specific 

antigen (PSA) and tonin); 14) Plasma kallikrein; 15) Mast cell proteases (MCP) 1 (chymase) 
to 8- 16) Myeloblasts (proteinase 3) (Wegener's autoantigen); 17) Plasminogen activators 
(urokinase-type, and tissue-type); 18) Trypsins I, II, III, and IV; 19) Tryptases; 20) Snake 
venom proteases such as ancrod, batroxobin, cerastobin, flavoxobin, and protein C activator; 
10 21) Collagenase from common cattle grub and collagenolytic protease from Atlantic sand 
fiddler crab; 22) Apolipoprotein(a); 23) Blood fluke cercarial protease; 24) Drosophila 
trypsin like proteases: alpha, easter, snake-locus; 25) Drosophila protease stubble (gene sb); 
and 26) Major mite fecal allergen Der p III. All the above proteins belong to family SI in 
the classification of peptidases (Rawlings N.D., et ai, Meth. Enzymol. (1994) 2^:19; 

15 ^•//^.vn aS v.ch^r-^^-^ ? " e " tidas tx ^ ori e inate fr ° m eUkary ° tiC SPCCieS - h 
should be noted that bacterial proteases that belong to family S2A are similar enough m the 
regions of the active site residues that they can be picked up by the same patterns. 

The consensus patterns for this trypsin protein family are: 1) [LIVM]-[ST]-A- 
[STAG]-H-C, where H is the active site residue. All sequences known to belong to this class 

20 detected by the pattern, except for complement components Clr and Cls, pig plasminogen, 
bovine protein C, rodent urokinase, ancrod, gyroxin and two insect trypsins; 2) 
[DNSTAGC]-[GSTAPIMVQH]-x(2)-G-[DE]-S-G-[GS]-[SAPHV]-[LIVMFYWH]- 

[LIVMFYSTANQH], where S is the active site residue. All sequences known to belong to 
this family are detected by the above consensus sequences, except for 18 different proteases 
25 which have lost the first conserved glycine. If a protein includes both the serine and the 
histidine active site signatures, the probability of it being a trypsin family serine protease is 
100%. 

^mrv^n G-Beta Repeats. SEQ ID NOS: 188 and 335 represent novel 
members of the WD domain/G-beta repeat family. Beta-transducin (G-beta) is one of the 
30 three subunits (alpha, beta, and gamma) of the guanine nucleotide-binding proteins (G 
proteins) which act as intermediaries in the transduction of signals generated by 
transmembrane receptors (Oilman, Annu. Rev. Biochem. (1987) 56:615). The alpha subunit 
binds to and hydrolyzes GTP; the functions of the beta and gamma subunits are less clear but 
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ft=y seen, * be reared for toe replacement of GDP by G 1Pas weil as for SZT*" 

anchoring and receptor recognition. 

In higher eukaryotes, G-beta exists as a small multigene family of highly conserved 
protems of about 340 amino acid residues. Structurally, G-beta consists of eight tandem 
> repeats of about 40 residues, each containing a central Trp-Asp motif (this type of repeat is 
sometimes called a WD-40 repeat). Such a repetitive segment has been shown to exist in a 
number of other proteins including: human LIS1, a neuronal protein involved in type-1 
hssencephaly; and mammalian coatomer beta' subunit (beta'-COP), a component of a 
cytosolic protein complex that reversibly associates with Golgi membranes to form vesicles 
that mediate biosynthetic protein transport. 

The consensus pattern for the WD domain/G-Beta repeat family is: [LIVMSTAC1- 
[LIVMFYWSTAGC]-[LIMSTAG]-[LIVMSTAGC]-x(2)-[DN]-x(2)-[LIVMWSTAC]-x- 

TT tt r-» *~w,^~- . 

L x-^j „ - L ^i^iNj-tJui vmrdi AUUNj. 

r) wnt Family of Developmental Signaling Proline SEQ ID NO: 23, 291 324 330 
341, and 353 correspond to novel members of the wnt family of developmental signaling 
protems. Wnt-1 (previously known as int-1), the seminal member of this family, (Nusse R 
TrendsGene, (1988) 4:291) is a proto-oncogene induced by the integration of the mouse " 
mammary tumor virus. It is thought to play a role in intercellular communication and seems 
to be a stalling molecule important in the development of the central nervous system 
(CNS). The sequence of wnt-1 is highly conserved in mammals, fish, and amphibians Wnt- 

1 was found to be a member of a large family of related proteins (Nusse R., et al Cell 
(1992) 6-P.-1073; McMahon A.P., Trends Genet. (1992) ft l; Moon R.T., BioEssays (1993) 
75:91) that m m mought tQ be devdopmental ^ ^ ^ ^ ^_ 

2 (also known as irp), wnt-3, -3A, -4, -5A, -5B, -6, -7A, -7B, -8, -8B, -9 and -10. At least 
four members of this family are present in Drosophila; one of them, wingless (wg) is 
imphcated in segmentation polarity. All these proteins share the following features 
characteristics of secretory proteins: a signal peptide, several potential N-glycosylation sites 
and 22 conserved cysteines that are probably involved in disulfide bonds. The Wnt proteins 
seem to adhere to the plasma membrane of the secreting cells and are therefore likely to 
signal over only few cell diameters. The consensus pattern, which is based upon a highly 
conserved region including three cysteines, is as follows: C-K-C-H-G-[LIVMT]-S-G-x-C 
All sequences known to belong to this family are detected by the provided consensus pattern 

s) Ww/rspS/WWP Domain-Comaininp Protein^ . SEQ ID NOS: 188, 379 , and 395 
represent polynucleotides encoding a polypeptide in the family of WW/rsp5/WWP domain 
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cSES-*-. ^ WW domain (Bork « al, Trent* Biocnem. ScL (1994) 79:531; 
^etal.,Biochem. Biophys. Res. Commun. (1994) 205:1201; Hofmann al.,FEBS Lett. 
(1995) 558:153; and Sudol et al, FEBS Lett. (1995) 369:61), also known as rs P 5 or WWP), 
was originally discovered as a short conserved region in a number of unrelated proteins, 
among them dystrophin, the gene responsible for Duchenne muscular dystrophy. The 
domain, which spans about 35 residues, is repeated up to 4 times in some proteins. It has 
been shown (Chen et al, Proc. Natl Acad. ScL USA (1995) 92:7819) to bind proteins with 
particular proline-motifs, [AP]-P-P-[AP]-Y, and thus resembles somewhat SH3 domams. It 
appears to contain beta-strands grouped around four conserved aromatic positions, generally 
Trp The name WW or WWP derives from the presence of these Trp as well as that of a 
conserved Pro. It is frequently associated with other domains typical for proteins m signal 

transduction processes. 

Proteins containing the WW domain include: 

1 Dystrophin, a multidomain cytoskeletal protein. Its longest alternatively 
spliced form consists of an N-terminal actin-binding domain, followed by 24 spectrin-like 
repeats, a cysteine-rich calcium-binding domain and a C-terminal globular domam. 
Dystrophins form tetramers and is thought to have multiple functions including involvement 
in membrane stability, transduction of contractile forces to the extracellular environment and 
organization of membrane specialization. Mutations in the dystrophin gene lead to muscular 
dystrophy of Duchenne or Becker type. Dystrophin contains one WW domain C-terminal of 
the spectrin-repeats. 

2. Vertebrate YAP protein, which is a substrate of an unknown senne kinase. It 
binds to the SH3 domain of the Yes oncoprotein via a proline-rich region. This protein 
appears in alternatively spliced isoforms, containing either one or two WW domams. 

3 IQGAP, which is a human GTPase activating protein acting on ras. It 
contains an N-terminal domain similar to fly muscle m P 20 protein and a C-terminal ras 

GTPase activator domain. 

For the sensitive detection of WW domains, the profile spans the whole homology 
region as well as a pattern. The consensus for this family is: W-x(9,ll)-[VFY]-[FYW]- 
) x(6,7)-[GSTNE]-[GSTQCR]-[FYW]-x(2)-P. 

t) 7 ;~^n pP r C2H2Tvt>e. SEQ ID NO:61, 306, and 386 correspond to 
polynucleotides encoding novel members of the of the C2H2 type zinc finger protein family. 
Zinc finger domains (Klug et al, Trends Biochem. Sci. (1987) 72:464; Evans et al, Cell 
(1988) 52 1; Payre * al, FEBS Let, (1988) 23*245; Miller et al, EMBOJ. (1985) 4:1609; 
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and Berg, Proc. Natl. Acad. Sci. USA (1 988) .5:99) are ^.^^Z^ 
structures first identified in the Xenopus transcription factor TFIIIA. These domains have 
since been found in numerous nucleic acid-binding proteins. A zinc finger domain is 
composed of 25 to 30 amino acid residues. Two cysteine or histidine residues are positioned 
5 at both extremities of the domain, which are involved in the tetrahedral coordination of a 
zinc atom. It has been proposed that such a domain interacts with about five nucleotides 

Many classes of zinc fingers are characterized according to the number and positions 
of the h.stidine and cysteine residues involved in the zinc atom coordination. In the first 
class to be characterized, called C2H2, the first pair of zinc coordinating residues are 
10 cysteines, while the second pair are histidines. A number of experimental reports have 
demonstrated the zinc-dependent DNA or RNA binding property of some members of this 
class. 

Mammalian proteins having a C2H2 zipper include (number in parenthesis indicates 
number of zinc finger regions in the protein): basonuclin (6), BCL-6/LAZ-3 (6), erythroid 
krueppel-hke transcription factor (3), transcription factors Spl (3), Sp2 (3), Sp3 (3) and 
Sp(4) 3, transcriptional repressor YY1 (4), Wilms' tumor protein (4), EGRl/Krox24 (3) 
EGR2/Krox20 (3), EGR3/Pilot (3), EGR4/AT133 (4), Evi-1 (10), GLI1 (5), GLI2 (4 + ) ' 
GLI3 (3 + ), HIV-EP1/ZNF40 (4), HIV-EP2 (2), KR1 ( 9+ ), KR2 (9), KR3 (15 + ), KR4 04+) 
KR5 (1 1 + ), HF.12 (6 + ), REX-1 (4), ZfX (13), ZfY (13), Zrp-35 (18), ZNF7 (15), ZNF8 (7) 

20 ZNF35(10),ZNF42/MZF-l(13),ZNF43 (22),ZNF46/Kup(2),ZNF76(7),ZNF91(36) ' 
ZNF133(3). h 

In addition to the conserved zinc ligand residues, it has been shown that a number of 
other positions are also important for the structural integrity of the C2H2 zinc fingers 
(Rosenfeld et al.,J. Biomol. Struct. Dyn. (1993) 77:557) The best conserved position is 
25 found four residues after the second cysteine; it is generally an aromatic or aliphatic residue 
The consensus pattern for C2H2 zinc fingers is: C-x(2,4)-C-x(3)-[LIVMFYWC]- 
x(8)-H-x(3,5)-H. The two C's and two H's are zinc ligands. 

u) ZincFinp er.crHrn^ SEQ ID N0.322 corresponds to a polynucleotide 
encoding a novel member of the zinc finger CCHC family. The CCHC zinc finger protein 
30 family to date has been mostly composed of retroviral gag proteins (nucleocapsid) The 
prototype structure of this family is from HIV. The family also contains members involved 
m eukaryotic gene regulation, such as C. elegans GLH-1. The consensus sequence of this 
family ,s based upon the common structure of an 18-residue zinc finger. 
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W ° 99 ^ 9 1 8 l. p; ^^ r lu^llnnrntease Domain. SEQ ID NO:306 and 395 represent 
polynucleotides encoding novel members of the zinc-binding metalloprotease domain 
protein family. The majority of zinc-dependent metallopeptidases (with the notable 
exception of the carboxypeptidases) share a common pattern of primary structure (Jongeneel 
5 et al, FEBS Lett. (1989) 242:21 1; Murphy et al, FEBS Lett. (1991) 2*9:4; and Bode et al, 
Zoology (1996) 99:237) in the part of their sequence involved in the binding of zinc, and can 
be grouped together as a superfamily, known as the metzincins, on the basis of this sequence 
similarity. Examples of these proteins include: 1) Angiotensin-converting enzyme (EC 
3.4.15.1) (dipeptidyl carboxypeptidase I) (ACE), the enzyme responsible for hydrolyzing 
1 0 angiotensin I to angiotensin II. 2) Mammalian extracellular matrix metalloproteinases 
(known as matrixins) (Woessner, FASEB J. (1991) 5:2145): MMP-1 (EC 3.4.24.7) 
(interstitial collagenase), MMP-2 (EC 3.4.24.24) (72 Kd gelatinase), MMP-9 (EC 3.4.24.35) 
(92 Kd gelatinase), MMP-7 (EC 3.4.24.23) (matrylisin), MMP-8 (EC 3.4.24.34) (neutrophil 
collagenase), MMP-3 (EC 3.4.24.17) (stromelysin-1), MMP-10 (EC 3.4.24.22) 
1 5 (stromelysin-2), and MMP-1 1 (stromelysin-3), MMP-12 (EC 3.4.24.65) (macrophage 
metalloelastase). 3) Endothelin-converting enzyme 1 (EC 3.4.24.71) (ECE-1), which 
processes the precursor of endothelin to release the active peptide. 

A signature pattern which includes the two histidine and the glutamic acid residues is 
sufficient to detect this superfamily of proteins, having the consensus pattern: [GSTALIVN]- 
20 x(2)-H-E-[LIVMFYW]-{DEHRKP}-H-x-[LIVMFYWGSPQ]. The two FTs are zinc 
ligands, and E is the active site residue. 

FxarnP le 4: nifWntial Exnre ^™ nf Polynucleotides of the Invention : Description of 
l ibraries and Detection of Differential Expression 

25 The relative expression levels of the polynucleotides of the invention was assessed in 

several libraries prepared from various sources, including cell lines and patient tissue 
samples. Table 4 provides a summary of these libraries, including the shortened library 
name (used hereafter), the mRNA source used to prepared the cDNA library, the "nickname" 
of the library that is used in the tables below (in quotes), and the approximate number of 

30 clones in the library. 
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Table 4 Description of cDNA Libraries 



Library 
(lib #) 

1 


Description — 


Number of 
Clones in this 

Clu^fprinor 


i 


Kml2L4 

Human Colon Cell Line, High Metastatic Potential (derived 

fromKml2C) 

"High Colon" 


viuam ill 2f 

307133 


Z 


Kml2C — — 

Human Colon Cell Line, Low Metastatic Potential 
Low Colon" 


284755 


3 
4 


MDA-MB-23 1 " — 

Human Breast Cancer Cell Line, High Metastatic Potential- 
micro-metastases in lung 
"High Breast" 


326937 




MCF7 

Human Breast Cancer Cell, Non Metastatic 
"Low Breast" 


1 

3 1 RQ7G 


8 
9 


MV-522 ~ " " 

Human Lung Cancer Cell Line, High Metastatic Potential 
High Lung" 


223620 


12 


UCP-3 ~ — 

Human Lung Cancer Cell Line, Low Metastatic Potential 
"Low Lung" 


312503 




n^^^ ,C ? VaSCUlar endothe, ial cells (HMEC) - Untreated 
PCR (OhgodT) cDNA library 


41938 


13 
14 


Human microvascular endothelial cells (HMEC) - bFGF 
treated ' 

PCR (OligodT) cDNA library 


4? i nn 




Human microvascular endothelial cells (HMEC) - VEGF 
treated 

PCR (OligodT) cDNA library 




15 


Normal Colon - UC#2 Patient " 

PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 




16 
17 


Colon i umor- UC#2 Patient * " 

PCR (OligodT) cDNA library 
"Normal Colon Tumor Tissue" 


3562S 




Liver Metastasis trom Colon 1 umor of UC#2 Patient 
PCR (OligodT) cDNA library 
High Colon Metastasis Tissue" 


36984 


18 

1 


Normal Colon - UC#3 Patient ' " - — 

PCR (OligodT) cDNA library 
'Normal Colon Tumor Tissue" 


36216 


19 ( 
1 


Jolon Tumor- UC#3 Patient 

>CR (OligodT) cDNA library 
'High Colon Tumor Tissue" 


41388 


20 I 
I 

If 


.iver Metastasis trom Colon I umor of UC#3 Patient 

>CR (OligodT) cDNA library 
High Colon Metastasis Tissue" 


30956 
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W ° Tta?KM12L4 and KM12C cell lines are described in Example 1 above. The MDA- 
MB-231 cell line was originally isolated from pleural effusions (Cailleau, J. Natl Cancer. 
Inst. (1974) 53:661), is of high metastatic potential, and forms poorly differentiated 
adenocarcinoma grade II in nude mice consistent with breast carcinoma. The MCF7 cell line 
5 was derived from a pleural effusion of a breast adenocarcinoma and is non-metastatic. The 
MV-522 cell line is derived from a human lung carcinoma and is of high metastatic 
potential. The UCP-3 cell line is a low metastatic human lung carcinoma cell line; the MV- 
522 is a high metastatic variant of UCP-3. These cell lines are well-recognized in the art as 
models for the study of human breast and lung cancer (see, e.g., Chandrasekaran et al, 
10 Cancer Res. (1979) 39:870 (MDA-MB-231 and MCF-7 ); Gastpar et al.,JMed Chem (1998) 
41 -4965 (MDA-MB-23 1 and MCF-7); Ranson et al , Br J Cancer (1998) 77: 1 586 (MDA- 
MB-231 and MCF-7); Kuang et al, Nucleic Acids Res (1998) 26:1 116 (MDA-MB-231 and 
MCF-7); Varki et al, Int J Cancer (1987) 40A6 (UCP-3); Varki et al, Tumour Biol. (1990) 
77:327; (MV-522 and UCP-3); Varki et al, Anticancer Res. (1990) 70:637; (MV-522); 
15 Kelner \t al, Anticancer Res (1995) 75:867 (MV-522); and Zhang et al, Anticancer Drugs 
(1997) 5:696 (MV522)). The samples of libraries 15-20 are derived from two different 
patients (UC#2, and UC#3). 

Each of the libraries is composed of a collection of cDNA clones that in turn are 
representative of the mRNAs expressed in the indicated mRNA source. In order to facilitate 
20 the analysis of the millions of sequences in each library, the sequences were assigned to 
clusters. The concept of "cluster of clones" is derived from a sorting/grouping of cDNA 
clones based on their hybridization pattern to a panel of roughly 300 7bp oligonucleotide 
probes (see Drmanac et al, Genomics (1996) 37(1):29). Random cDNA clones from a 
tissue library are hybridized at moderate stringency to 300 7bp oligonucleotides. Each 
25 oligonucleotide has some measure of specific hybridization to that specific clone. The 
combination of 300 of these measures of hybridization for 300 probes equals the 
"hybridization signature" for a specific clone. Clones with similar sequence will have 
similar hybridization signatures. By developing a sorting/grouping algorithm to analyze 
these signatures, groups of clones in a library can be identified and brought together 
30 computationally. These groups of clones are termed "clusters". Depending on the 

stringency of the selection in the algorithm (similar to the stringency of hybridization in a 
classic library cDNA screening protocol), the "purity" of each cluster can be controlled. For 
example, artifacts of clustering may occur in computational clustering just as artifacts can 
occur in "wet-lab" screening of a cDNA library with 400 bp cDNA fragments, at even the 
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highest Agency. The stringency used in the indentation of cluster her^"' 0 
groups of clones that are in general from the same cDNA or closely related cDNAs CJosely 
related clones can be a result of different lengu, clones of the same cDNA, closely related 
clones from highly related gene families, „, splice variants of the same cDNA 
5 Differential expression for a selected cluster was assessed by first determining the 

number of cDNA clones corresponding to the selected cluster in the firs, notary (Clones in 
1 ), and the determining the number of cDNA clones corresponding to the selected cluster in 
ft. second hbtary (Clones in 2"). Differential expression of the seiected cluster in the firs, 
hbraty relative ,„ the second library is expressed as a W of percent expression betw« n 
10 thetwohtoarie, In general, the "ratio- is calculated by: 1) calculating the percent 
expression of the selected cluster in the firs, library by dividing the number of clones 

corresponding to a selected cluster in the firs, library bv the total mmh *r „f , 

from the firs. Itbrary; 2) calculating toe percent expression of the selected cluster fa the 
second library by dividing the number of clones corresponding to a selected cluster in a 
second Itary by fte tota! number of clones analyzed from the second library; 3) dividing 
the calculated percent expression from the firs, library by the calculated percent expression 
from the second library. If me "number of clones" corresponding to a selected cluster in a 
hbrary is zero, the value is se, a, 1 to aid in calculation. The formula used in calculating the 
raao takes into account the "depth" of each of the libraries being compared, me tota, 
20 number of clones analyzed in each library. 

In general, a polynucleotide is said to be significantly differentially expressed 
between two samples when the ratio value is greater than at least about 2, preferably greater 
than at least about 3, more preferably greater than at least about 5 , where the ratio value is 
calculated using the method described above. The significance of differential expression is 
-5 determmed using a z score test (Zar, Biostatistic*. An^ Prentice ^ ^ ^ 
"Differences between Proportions," pp 296-298 (1 974). 

Tables 5 to 7 (inserted before the claims) show the number of clones in each of the 
above libraries that were analyzed for differential expression. Examples of differentially 
expressed polynucleotides of particular interest are described in more detail below. 

^y^.eotid.s jm am^sg^ in High Mahwtwtfa , p _,„, D „ vt 

Cancer Cells Versus T,nw Metastatic IWt rw,- 
A number of polynucleotide sequences have been identified that are differentially 
expressed between cells derived from high metastatic potential breast cancer tissue and low 
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JStatic breast cancer cells. Expression of these sequences in breast cancer can be 
valuable in determining diagnostic, prognostic and/or treatment information. For example, 
sequences that are highly expressed in the high metastatic potential cells can be indicative of 
increased expression of genes or regulatory sequences involved in the metastatic process. A 
patient sample displaying an increased level of one or more of these polynucleotides may 
thus warrant more aggressive treatment. In another example, sequences that display higher 
expression in the low metastatic potential cells can be associated with genes or regulatory 
sequences that inhibit metastasis, and thus the expression of these polynucleotides in a 
sample may warrant a more positive prognosis than the gross pathology would suggest. 

The differential expression of these polynucleotides can be used as a diagnostic 
marker, a prognostic marker, for risk assessment, patient treatment and the like. These 
polynucleotide sequences can also be used in combination with other known molecular 

and/or biochemical markers. 

The following table summarizes identified polynucleotides with differential 
expression between high metastatic potential breast cancer cells and low metastatic potential 
breast cancer cells. 
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™, 8 . ^-tiaUyexpressedpo^^ 
cancer vs. low metastatic breast cancer cells 

SEQID 
NO. 



Differentia] Expression 



Cluster 
ID 



Clones in 
I s ' 



Clones in 
2 nd 



Ratio 



9 

42 
52 
62 
65 
66 
68 
114 
123 
144 
172 
178 
214 
219 
223 
258 
317 
v 379 
4 
39 
74 
81 
130 
157 
162 
183 
202 
298 
338 
384 
386 
388 



High Breast > Low Breast (Lib3 > Lib4) 262T 

High Breast > Low Breast (Lib3 > Lib4) 307 
High Breast > Low Breast (Lib3 > Lib4) 1 9 
High Breast > Low Breast (Lib3 > Lib4) 2623 
High Breast > Low Breast (Lib3 > Lib4) 5749 
High Breast > Low Breast (Lib3 > Lib4) 6455 
High Breast > Low Breast (Lib3 > Lib4) 6455 
High Breast > Low Breast (Lib3 > Lib4) 2030 
High Breast > Low Breast (Lib3 > Lib4) 3389 

High Breast > Low Breast (Lib3 > Lib4) 4623 
High Breast > Low Breast (Lib3 > Lib4) 102 

High Breast > Low Breast (Lib3 > Lib4) 368 1 
High Breast > I 

High Breast > Low Breast (Lib3 > Lib4) 3389 

High Breast > Low Breast (Lib3 > Lib4) 1 399 

High Breast > Low Breast (Lib3 > Lib4) 4837 

High Breast > Low Breast (Lib3 > Lib4) 1 577 

High Breast > Low Breast ( Lib3 > Lib4) 260 



j — ■ y^i^F ^ i ^lU*t| 

Low Breast > High Breast (Lib4 > Lib3) — 3706~ 

Low Breast > High Breast (Lib4 > Lib3) 401 6 

Low Breast > High Breast (Lib4>Lib3) 6268 

Low Breast > High Breast (Lib4 > Lib3) 40392 

Low Breast > High Breast (Lib4 > Lib3) 1 3 1 83 

Low Breast > High Breast (Lib4 > Lib3) 54 1 7 

Low Breast > High Breast (Lib4 > Lib3) 9685 

Low Breast > High Breast (Lib4 > Lib3) 733 7 

Low Breast > High Breast (Lib4 > Lib3) 6124 

Low Breast > High Breast (Lib4 > Lib3) 1 037 

Low Breast > High Breast (Lib4 > Lib3) 689 

Low Breast > High Breast (Lib4 > Lib3) 697 

Low Breast > High Breast (Lib4 > Lib3) 4568 

Low Breast > High Breast (Lib4 > Lib3) 5622 



Library Library 


31 


4 


196 


75 


1364 


525 


31 


4 


9 


0 


6 


0 


6 


0 


iZ 


4 


13 


2 


12 


2 


278 


116 


10 


1 


8 


1 


13 


2 


19 


7 


10 


0 


25 


3 


27 


2 


22 


4 



7.561356 

2.549721 

2.534854 

7.561356 

8.780930 

5.853953 

5.853953 

7.805271 

6.341782 

5.853953 

2.338217 

9.756589 

7.805271 

6.341782 

2.648217 

9.756589 

8.130490 

13.17139 



6 

18 

8 

7 

9 

7 

16 

9 

22 

36 

72 

9 

13 



0 
3 
1 

0 
0 
0 
3 
1 

4 

17 

30 

0 

2 



5.637215 
6.149690 
6.149690 
8.199586 
7.174638 
9.224535 
7.174638 
5.466391 
9.224535 
5.637215 
2.170478 
2.459876 
9.224535 
6.662164 



C ancer Cells Vmm I ow M« aa .„v , „ rr ^ nrrr r , „ x 
A number of polynuejeotide sequences have been identified that are differentially 
expressed between cells derived from high mettstatic potentia. .ung cancer .issue and low 
metastatic lung cancer cell, Expression of these sequences in lung cancer tissue can be 
valuable in determining diagnostic, prognostic and/or treatment information. For example 
*quences tha, are high ly expressed in the high metasfctic potential ce,ls are associated can' 
be mchcativ. of increased expression of genes or regulatory sequences involved in the 
meu^tic process. A patient sample displaying an increased level of one or more of these 
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p 0 r y nuSdes may thus warrant more aggressive treatment. In another example, 
sequences that display higher expression in the low metastatic potential cells can be 
associated with genes or regulatory sequences that inhibit metastasis, and thus the express.on 
of these polynucleotides in a sample may warrant a more positive prognosis than the gross 

pathology would suggest. 

The differential expression of these polynucleotides can be used as a diagnostic 
marker, a prognostic marker, for risk assessment, patient treatment and the like. These 
polynucleotide sequences can also be used in combination with other known molecular 

and/or biochemical markers. 

The following table summarizes identified polynucleotides with differential 
expression between high metastatic potential lung cancer cells and low metastatic potential 
lung cancer cells: 



Table 9 

SEQID 
NO. 

400 
9 

34 
42 
62 
74 
106 
119 
361 
369 
371 
379 
395 
135 
154 
160 
260 
308 
323 
349 
381 



Differentially expressed polynucleotides: High metastatic potential lung 
cancer vs. low metastatic lung cancer cells 



Differential Expression 



High Lung > Low Lung (Lib8 > Lib 9) 
High Lung > Low Lung (Lib8 > Lib9) 
High Lung > Low Lung (Lib8 > Lib9) 
High Lung > Low Lung (Lib8 > Lib9) 
High Lung > Low Lung (Lib8 > Lib9) 
High Lung > Low Lung (Lib8 > Lib9) 
High Lung > Low Lung (Lib8 > Lib9) 
High Lung > Low Lung (Lib8 > Lib9) 
High Lung > Low Lung (Lib8 > Lib9) 
High Lung > Low Lung (Lib8 > Lib9) 
High Lung > Low Lung (Lib8 > Lib9) 
High Lung > Low Lung (Lib8 > Lib9) 
High Lung > Low Lung (Lib8 > Lib9) 
Low Lung > High Lung (Lib9 > Lib8) 
Low Lung > High Lung (Lib9 > Lib8) 
Low Lung > High Lung (Lib9 > Lib8) 
Low Lung > High Lung (Lib9 > Lib8) 
Low Lung > High Lung (Lib9 > Lib8) 
Low Lung > High Lung (Lib9 > Lib8) 
Low Lung > High Lung (Lib9 > Lib8) 
Low Lung > High Lung (Lib9 > Lib8) 



Cluster 
ID 



Clones in 
1 st 





Library 


14929 


23 


2623 


6 


5832 


5 


307 


79 


2623 


6 


6268 


5 


10717 


8 


8 


1355 


1120 


5 


2790 


6 


8847 


6 


260 


15 


13538 


9 


36313 


30 


5345 


27 


4386 


21 


4141 


27 


15855 


213 


5257 


25 


2797 


14 


2428 


19 



Clones in 


Ratio 


2 nd 




Library 




16 


2.008868 


1 


8.384840 


0 


6.987366 


27 


4.088903 


1 


8.384840 


0 


6.987366 


0 


11.17978 


122 


15.52111 


0 


6.987366 


0 


8.384840 


1 


8.384840 


0 


20.96210 


1 


12.57726 


1 


21.46731 


6 


3.220097 


3 


5.009039 


4 


4.830145 


12 


12.70149 


5 


3.577885 


1 


10.01807 


2 


6.797982 



ExamEleTl ^ly"™^" Diff e^llv ^nressed in Hirh Metastatic Potential Colon 
ron^r fVU* Versus L™ Metastatic Colon Cancer Cells 
A number of polynucleotide sequences have been identified that are differentially 
20 expressed between cells derived from high metastatic potential colon cancer tissue and low 
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— c colon CMCer cells . of (hese ^ - co|on ^ e PC =I . 

valuable „, determining diagnostic, prognostk ^ ^ 

sequences ^ ^ Wghly expressed jn ^ Wgh meiasta(jc Mjis e 

■ncreased expression of g e„es „ r regulatory invo , ved fa « ° f 

^-^d iS p layinga „ increasedlevelofoileormoreofthes nucleoUd P es ^ A 
*us war™, more a „ e _ In ^ exampte _ J 

fences tha, lnhibit me^tasis, md ^ ^ rf ^ * 

sample ma, ™ a more posiUve ^ fte ^ ^ 

Tie differential expression of these polynucleotides can be used as a diagnostic 
.aprognosticrnarte.for risk assessment , patienttreannen, and ,he like ^ 

polynucleotide sequences can also be used in comWn,,,^ ..... 

and/or biochemical markets. m0,eCUIar 

The following table summarizes identified polynucleotides with differential 



SEQID 
NO. 

1 

176 
241 
362 
374 



Differential Expression 



39 

97 

134 

317 

357 



High Co bn > Low Colon (Libl > Lib2) 6660 

High Co on > Low Colon (Libl > Lib2) 3765 

High Co on > Low Colon (Libl > Lib2) 4275 

High Co on > Low Colon (Libl > Lib2) 6420 
High Colon > Low Colon fL ibl > T 6420 



Low uojon > High Colon (L.b2 > Libl) 4016 

Low Colon > High Colon (Lib2 > Libl ) 945 

Low Colon > High Colon (Lib2 > Libl ) 2464 

Low Colon > High Colon (Lib2> Libl) 1577 

Low Colon > High Colon (Lib2 > Libl ) 4309 



Cluster Clones in Clones in Ratio 
ID ] s< 2 nd 

Library Library 

7 0 6.489973 
19 6 2.935940 
I 1 2 5.099264 
I 0 7.417112 

8 0 7.417112 



14 
21 
19 
40 
13 



5 
9 
5 

12 
4 



3.020043 
2.516702 
4.098630 
3.595289 
3.505407 



20 ***** 

a fr""*" C "'" n "p™, P.,,.-, T= T .. 

A number of polynucleotide sequences have been identified ma, are differential 

25 2 , PreSSi °" °" ta - S ~ iD «*» — — can be valuable in 

daermmmg prognostic and/or tieatmen, mformation . For example, sequences 
<-> are taghly expressed in the high metasfctic po te „tiai ce,.s are associated can be 
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indkativf onLreased expression of genes or regulatory sequences involved in the advanced 
disease state which involves processes such as angiogenesis, dedifferentiation, cell 
replication, and metastasis. A patient sample displaying an increased level of one or more of 
these polynucleotides may thus warrant more aggressive treatment. 
5 The differential expression of these polynucleotides can be used as a diagnostic 

marker, a prognostic marker, for risk assessment, patient treatment and the like. These 
polynucleotide sequences can also be used in combination with other known molecular 

and/or biochemical markers. 

The following table summarizes identified polynucleotides with differential 
10 expression between high metastatic potential colon cancer cells and normal colon cells: 

Table 11: Differentially expressed polynucleotides: High metastatic potential colon 
tissue vs. normal colon tissue 

SEQ ID Differential Expression 

NO. 



52 
52 
172 



Colon Tissue of UC#3 (Lib20 > Libl 8) 
High Colon Metastasis Tissue > Normal 
Tissue in UC#2 (Lib 17 > Lib 15) 
High Colon Metastasis Tissue > Normal 
Tissue in UC#2 (Lib 17 > Lib 15) 



Cluster 


Clones in 


Clones in 


Ratio 


ID 




2 nd 






Library 


Library 




19 


10 


0 


11.69918 


19 


13 


2 


6.025646 


102 


65 


22 


2.738930 



15 Example 9: Polynucleotides p ifWnt^llv F.xnressed at Higher Levels in High Colon 
T„™„r PntPntial Patient Ti ^ »* Versus Metastasized Colon Cancer Patient 
Tissue 

A number of polynucleotide sequences have been identified that are differentially 
expressed between cells derived from high tumor potential colon cancer tissue and cells 

20 derived from high metastatic potential colon cancer cells. Expression of these sequences in 
colon cancer tissue can be valuable in determining diagnostic, prognostic and/or treatment 
information associated with the transformation of precancerous tissue to malignant tissue. 
This information can be useful in the prevention of achieving the advanced malignant state 
in these tissues, and can be important in risk assessment for a patient. 

25 The following table summarizes identified polynucleotides with differential 

expression between high tumor potential colon cancer tissue and cells derived from high 
metastatic potential colon cancer cells: 
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T»b,e n: Differentially e „ polynucleotides: H^tumor potentia^Se'" 

vs. metastatic colon tissue 

seq n> Differential Expression Cluster ciones . n im aones ^ 2 d ^ 

52 High Colon Tumor Tissue > Metastasis 19 10 69 L,bra,7 /'^ 

TissueofUC#3(Libl9>Lib20) 5.160829 

119 High Colon Tumor Tissue > Metastasis 8 14 , 

Tissue of UC#3(Libl9>Lib20) 1 10.47124 

172 High Colon Tumor Tissue > Metastasis 102 <n m 

TissueofUC#3(Libl9>Lib20) 3.216168 

EXamplel °' ^Y""cleotides Differentially Exp r essed ,t H j g her I.e W ,« m ^ t.^ 
Potential Colon Cancer Patient v ersus Nnrmal Pat;o „, T .^ |r 

A number of polynucleotide sequences have been identified that are differentially 
~ u * UCT,Ve ° Irom 1U S |1 n™ 01 - ^ntial colon cancer tissue and normal 
tissue. Expression of these sequences in colon cancer tissue can be valuable in determining 
Agnostic, prognostic and/or treatinen, information associated with the prevention of 
achtevmg the malignant stittt in these tissues, and can be important in risk assessment for a 
patient. For example, sequences ma, are high]y expressed in the potential colon cancer cells 
are associated with or can be indicative of increased expression of genes or regulatory 
sequences involved in early tumor progression. A patient sample displaying an increased 
level of one or more of these polynucleotides may thus warrant closer attention or more 
frequent screening procedures to catch the malignant state as early as possible. 

The following table summarizes identified polynucleotides with differential 
expression between high metastittic potential colon cancer cells and normal colon cells: 

Table 13: Differentially expressed polynucleotides: High tumor potential colon tissue 

vs. normal colon tissue 

S IT Expression cl „ s( . r Clones to Clenesi. Ratio 

ID l*' 2 nd 

52 High Colon Tumor Tissue > Normal Tissue 19 Jf"™* 

ofUC#2(Libl6>Libl5) ™ ail,ssue 19 13 2 6.255508 

288 High Colon Tumor Tissue > Normal Tissue 1267 7 

ofUC#2(Lib!6>Libl5) rmallIssue 12 67 7 0 6.125253 

52 High Colon Tumor Tissue > Normal Tissue 19 ao a 

ofUC#3(Lib!9>Libl8) 69 0 60.37750 

High Colon Tumor Tissue > Normal Tissue 8 ia 

ofUC#3(Libl9>Libl8) 111,31 1,58116 8 14 1 12 . 25 050 

High Colon Tumor Tissue > Normal Tissue 102 ai n 
ofUC#3(Libl9>Libl8) 43 7 5.375222 
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Exanarteli: ^'v""^tides Di f^ntisllv Expressed Across Multiple Libraries 

A number of polynucleotide sequences have been identified that are differentially 
expressed between cancerous cells and normal cells across all three tissue types tested (i.e. , 
breast, colon, and lung). Expression of these sequences in a tissue or any origin can be 
valuable in determining diagnostic, prognostic and/or treatment information associated with 
the prevention of achieving the malignant state in these tissues, and can be important m nsk 
assessment for a patient. These polynucleotides can also serve as non-tissue specific 
markers of, for example, risk of metastasis of a tumor. The following table summarizes 
identified polynucleotides that were differentially expressed but without tissue type- 
specificity in the breast, colon, and lung libraries tested. 



Table 14: Polynucleotides Differentially Expressed Across 
Differential Expression Cluster 



SEQID 
NO. 



39 



42 



52 



62 



74 



Multiple Library Comparisons 
Clones in Clones in Ratio 

Library Library 



High Breast > Low Breast (Lib3 > Lib4) 
High Lung > Low Lung (Lib8 > Lib9) 



2623 
2623 



Low Breast > High Breast (Lib4 > Libi) 
Low Colon > High Colon (Lib2 > Libl) 
High Breast > Low Breast (Lib3 > Lib4) 
High Lung > Low Lung (Lib8 > Lib9) 
High Breast > Low Breast (Lib3 > Lib4) 
High Colon Metastasis Tissue > Normal 
Colon Tissue of UC#3 (Lib20 > Libl 8) 
High Colon Metastasis Tissue > Normal 
Tissue in UC#2 (Libl 7 > Libl 5) 
High Colon Tumor Tissue > Metastasis 
Tissue of UC#3 (Libl9 > Lib20) 
High Colon Tumor Tissue > Normal Tissue 
ofUC#2(Libl6>Libl5) 
High Colon Tumor Tissue > Normal Tissue 

ofUC#3(Libl9>Libl8) r __ 

High Breast > Low Breast (Lib3 > Lib4) 
High Lung > Low Lung (Lib8 > Lib9) 



4016 
4016 



31 

6 

6 



14 



307 
307 
19 



196 
79 



19 
19 
19 

19 



19 



1364 
10 

13 

69 

13 

69 



High Lung > Low Lung (Lib8 > Lib9) 
Low Breast > High Breast (Lib4 > Lib3) 



6268 
6268 



8 



7T9 High Colon Tumor Tissue > Metastasis 

Tissue of UC#3 (Libl9 > Lib20) 
High Colon Tumor Tissue > Normal Tissue 
ofUC#3(Libl9>Libl8) 
High Lung > Low Lung (Lib8 > Lib9) 8 



5 

18 
14 



8 



14 



4 
1 



0 
5 



75 
27 



~yj2 High Breast > Low Breast (Lib3 > Lib4) 
High Colon Metastasis Tissue > Normal 
Tissue in UC#2 (Libl 7 > Libl 5) 
High Colon Tumor Tissue > Metastasis 



102 
102 

102 



1355 
278 



65 



43 



525 
0 

2 

10 

2 

0 

T 

_i_ 
o 

2_ 
i 



i 



122 
116 



22 



10 



7.561356 
8.384840 



6.149690 
3.020043 



2.549721 
4.088903 
2.534854 
11.69918 

6.025646 

5.160829 

6.255508 

60.37750 



6.987366 
6.149690 
10.47124 



12.25050 



15.52111 



2.338217 
2.738930 

3.216168 
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SEQID Differential F*nr M ««» W PCT/US98/27610 

NQ "rental Express.cn CIuster c , ones fa 

ID l 5f 2 nd 

Tissue of UC#3 (Lib 1 9 > Lib20) Library Library 

High Colon Tumor Tissue > Normal Tissue 107 a-x 

QfUC*3(Libl9>Libl8) 2 43 7 5.375222 

317 HTgn ereast > Low Breast (L7b3^Lib4) 1577 25 5 

^=5 ^gw goton > High Colon (Lib2 > I.ihn 577 f n 3 8.130490 

™ 12_ 3 I 595289_ 

High Lung > Low Lung (Lib8>Lib9) 260 i l I ".17139 

7 ^ou 15 o 20.96210 

The cDNA libraries described herein were also analyzed to identify those 

polynucleotides that were specifically expressed in colon cells or tissue /. the 
polynucleotides were identified in libraries prepared from colon cell li nes „' tissu , but M 
in hbranes of breast or lung origin. The nnivn.-.^.. 

I inpa ... ' ' " r " J w " u » u «"were expressed in a colon cell 

line and/or in colon tissue, but were nre^nt in ^ u . , 

. • P CSent m 1116 breast or Iun 8 cDNA libraries described 

herein, are shown in Table 1 5 aescnoed 



10 



Table 15 Polynucleotides specifically expressed in colon cells. 
SEQID Cluster Clones in Clones in «rn m 

NO. l'« Library 2- Ubrary NO C,USter <?'° nesin 

5 36535 2 0 ^o^ 0 ' , 1 Library 2 nd Library 

13 27250 2 0 ™ 39648 2 0 ^ 

19 16283 3 0 S] 85064 1 0 

24 16918 4 0 lit 39391 2 0 

% 40108 2 0 2 4 3 2 6 39498 2 0 

32 32663 1 , f 42 2113 3 n 

43 39833 2 0 oV, 19255 2 0 

47 18957 3 0 ?S 22814 3 0 

48 39508 2 0 ' S 39563 2 ° 
56 7005 8 2 ?S 39420 2 0 

58 18957 3 0 5? 39412 2 0 

59 18957 .3 0 Si 38085 2 0 

60 16283 3 0 ?2 40054 1 0 
64 13238 4 j 39423 2 0 

70 39442 2 0 ?S 39453 2 0 

71 17036 4 J 2 '° 78091 1 0 
73 7005 8 2 ?77 f 168 2 0 
83 11476 6 0 S 39458 2 0 
86 39425 2 J III 14391 3 , 
94 21847 2 , 2? ? 39,95 2 0 

100 16731 3 2 ! 2 12977 s 0 

101 12439 4 0 14391 3 1 
113 17055 4 0 2? 16347 4 0 

120 67907 1 0 ?oa 39478 2 0 

121 12081 4 0 39392 2 0 
124 39174 2 0 £ 39180 J 
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SFO TD 


Cluster 


Clones in 


NO. 




1 st Library 


1 A>U 


8210 


2 


128 


40455 


2 


HQ 

1 J s 


22195 


3 


141 


86859 


1 




8672 


4 


153 


16977 


4 


156 


17036 


4 


159 


40044 


2 


161 


40044 


.2 


161 


22155 


3 


166 


15066 


4 


170 


11465 


5 


176 


3765 


19 


1X1 

1 O 1 


86110 


1 


1 87 


39648 


2 


1 85 


17076 


4 


1 86 


22794 


2 


1 87 
1 o / 


39171 


2 


104 


40455 


2 


1Q0 


16317 


3 


910 


39186 


2 


211 


40122 


2 


218 


26295 


2 


222 


4665 


5 


226 


82498 


1 


227 


35702 


2 



10 



6 

0 

0 

0 

4 

0 

0 

0 

0 

0 

0 

0 

6 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

9 

0 

0 



SEQID 


Cluster 


NO. 




301 


41633 


302 


23218 


303 


39380 


309 


84328 


314 


14367 


320 


39886 


324 


9061 


327 


16653 


328 


16985 


329 


12977 


330 


9061 


333 


16392 


342 


39486 


344 


6874 


345 


6874 


353 


11494 


354 


17062 


355 


16245 


356 


83103 


358 


13072 


366 


14364 


368 


84182 


372 


56020 


389 


7514 


391 


7570 


393 


23210 



PCTAJS98/27610 
Clones in Clones in 
1 st Library 2 nd Library 

1 1 
3 0 

2 0 
1 o 

3 0 
0 
2 
1 



2 

5 

3 

4 

5 

5 

3 

2 

6 

6 

4 

3 

4 
1 

4 
1 
1 
1 
5 
5 
3 



0 
0 
2 
0 
0 
3 
3 
0 
0 
0 
0 

1 

0 
0 
0 
3 
3 
0 



In addition to the above, SEQ ID NOS:159 and 161 were each present in one clone m 
each of Libl6 (Normal Colon Tumor Tissue), and SEQ ID NOS:344 and 345 were each 
present in one clone in Libl7 (High Colon Metastasis Tissue). No clones correspondmg to 
the colon-specific polynucleotides in the table above were present in any of Libranes 3, 4, * 
or 9. The polynucleotide provided above can 
find particular use in reference arrays, as described above 



be used as markers of cells of colon origin, and 



15 



F. xamP le 13: T,w;fWinn of Con tinuous Seq uences Hwin P n Polynucleotide of the 
Invention 

The novel polynucleotides were used to screen publicly available and proprietary 
databases to determine if any of the polynucleotides of SEQ ID NOS .1-404 would facilitate 
identification of a contiguous sequence, e.g., the polynucleotides would provide sequence 
that would result in 5' extension of another DNA sequence, resulting in production of a 
longer contiguous sequence composed of the provided polynucleotide and the other DNA 
sequence(s). Contiging was performed using the AssemblyLign program with the following 
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peters: ,) Overlap: Minimum Overlap Ungth: 30; % T ttngency: 50; SZT" 
Repeat Length: 3„ ; Alignmenl: gap crealion ^ , ^ gap ^ ^ 

Consensus: % Base designation threshold: 80. 

Using these parameters, 44 polynucleotides provided contiged sequences. TKese 
conttged seouen ees are provided as SEQ ID NOS:80,-844. The contiged sequences can be 

orreiated wtth the sequences of SEQ ID NO S:I -404 upon which the eontiged sentences ar* 
b*ed by ld e„,i fy i ng those sequences of SEQ ID NOS:,-404 and the contigtd sequenlsT 
SE QI O N OS: 8 0 1 ,44 ta share.e S a m ec,„ n e n a memTable , .a^,.™* 

,m,n C ° nt18 USm8 1118 ° VerlaP SetttogS h — (Stringency/Overiap). 
SEQ D NO :804 (30W0); SEQ ,D NO: 81 0 (20V20); SEQ ,D NO:812 (30W10)- SEQ ID 
NO:8i4 (40V20); SEQ ID NO:816 (30V.0): SEO m wv„, ™,, J. _ „ 
<20W20);SEQIDNO-841f40W201 T„ " ,• ,. — ""•"•-V.u, W :840 

- V U.S41 (40/4/20). To generals, the indicated polynucleotides did not 
contigusmganunimum20%struisencv 10nv,rl=„ -ru 

then, lm h„ ,a ™ ge " Cy ' 10 overla P- There was a corresponding increase in 

the number of degenerate codons in these sequences. 

The contiged sequences (SEQ ID NO-801 Rd4Wh„. . . , 

v iNu.sui-844) thus represent longer sequences that 

encompass a polynucleotide sequence of the invention. The contiged sequences were 2 
translated m al, three reading Irames to determine the best alignment ^ Mvidual 
fences using the BLAST programs as described above for SEQ ID NOS:,-404 and the 
^danon sciences SEQ ID NOS:405-800. Again the sequences were masked using the 
XBLASTp ro£ ™ for ^ lowcompWtyasdescribe(iabove . n _ 

Seven, of the contiged sequences were found „ encode polypeptides having characterise s 

^P^fam.hesJanc.orcomprismgaknowr.tocMona. domain (Table ,6, Thus the 
-ention encompasses ftagments, lusions, artd variants of such polynucleotides tha, retain 
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Table 16. Profile hits using contiged sequences 



SEQ 

ID 

NO. 



809 



824 



824 



833 



Sequence Name 



Contig_RTA00000 1 77AF.n. 1 8.3. 
Seq THC123051 



Contig_RTA00000187AF.g.24.1. 
Seq THC 168636 



Contig_RTA0000018 /AF.g.24.1. 
Seq THC 168636 



Contig_RTA00000 1 90AF.j .4. 1 . 
Seq THC228776 



Profile 



ATPases 



homeobox 



MAP kinase kinase 
protein kinase 



Start 
(Stop) 



778 
(1612) 



531 
(707) 



769 
(1494) 



170 
(1010) 



Score 



6040 



12080 



5784 



5027 



833 



Contig_RTA00000190AF.j.4.1. 
Seq THC228776 



protein kinase 



170 
(1010) 



All stop/start sequences are provided in the forward direction. 

5 The profiles for the ATPases (AAA) and protein kinase families are described above 

in Example 2. The homeobox and MAP kinase kinase protein families are described further 
below. 

Hnmeohox domain. The 'homeobox' is a protein domain of 60 amino acids (Gehring 
In: r.,MH P Wk to the Homeobox Genes . Duboule D., Ed., ppl-10, Oxford University Press, 

1 0 Oxford, (1994); Buerglin In: Guidebook to th« Homeobox Genes. pp25-72, Oxford 

University Press, Oxford, (1994); Gehring Trends Biochem. Sci. (1992) 77:277-280; Gehring 
etalAnnu. Rev. Genet. (1986)20:147-173; Schofield Trends Neurosci. (1987) 70:3-6; 
http://copan.bioz.unibas.ch/ homeo.html) first identified in number of Drosophila homeotic 
and segmentation proteins. It is extremely well conserved in many other animals, including 

15 vertebrates. This domain binds DNA through a helix-turn-helix type of structure. Several 
proteins that contain a homeobox domain play an important role in development. Most of 
these proteins are sequence-specific DNA-binding transcription factors. The homeobox 
domain is also very similar to a region of the yeast mating type proteins. These are 
sequence-specific DNA-binding proteins that act as master switches in yeast differentiation 

20 by controlling gene expression in a cell type-specific fashion. 

A schematic representation of the homeobox domain is shown below. The helix- 
tum-helix region is shown by the symbols *H' (for helix), and 'f (for turn). 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxHHHHHHHHmHHHHHHHHHxxxxxxxxxx 

oU 

25 1 
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The pattern detects homeobox sequences 24 residues long and spans |KHdti<^s34^rfL 
homeobox domain. The consensus pattern is as follows: [LIVMFYG]-[ASLVR]-x(2)- 

[LIVMSTACN]-x-[LIVM]-x(4)-[LIV]4RKNQESTAIYj-[LIVFSTNKH]-W-[FYVC]-x- 
[NDQTAH]-x(5)-[RKNAIMW] . 

MAP kinase kinase ( MAPKK ). MAP kinases (MAPK) are involved in signal 
transduction, and are important in cell cycle and cell growth controls. The MAP kinase 
kinase. (MAPKK) are dual-specificity protein kinases which phosphorylate and activate 
MAP kinase, MAPKK homologues have been found in yeast, invertebrates, amphibians, 
and mammals. Moreover, the MAPKK/MAPK phosphorylation switch constitutes a basic 
module activated in distinct pathways in yeast and in vertebrates. MAPKK regulation 
studies have led to the discovery of at least four MAPKK convergent pathways in higher 
organisms. One of these is similar to the yeast pheromone m™- u:.u 

x " f>*«*Tfuj vfinui iilCIUUCS 

the stel 1 protein kinase. Two other pathways require the activation of either one or both of 
the senne/threonine kinase-encoded oncogenes c-Raf-1 and c-Mos. Additionally, several 
studies suggest a possible effect of the cell cycle control regulator cyclin-dependent kinase 1 
(cdc2) on MAPKK activity. Finally, MAPKKs are apparently essential transducers through 
which signals must pass before reaching the nucleus. For review, see, e.g., Biologique Biol 
^'0»3)7ftlM-2<H ; Ni«^ 

Curr Opin Cell Biol (1993) 5:207-13; Dhanasekaran et al, Oncogene (1998) /7:1447- 55 . 

Kiefer et al., Biochem Soc Trans (1997) 25:491-8; and Hill, Cell Signal (1996) 5:533-44 ' 
Those skilled in the art will recognize, or be able to ascertain, using not more than 

routine experimentation, many equivalents to the specific embodiments of the invention 

described herein. Such specific embodiments and equivalents are intended to be 

encompassed by the following claims. 

All publications and patent applications cited in this specification are herein 

incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of any 
publication is for its disclosure prior to the filing date and should not be construed as an 
admission that the present invention is not entitled to antedate such publication by virtue of 
prior invention. 



120 



10 



PCTAJS98/27610 

W ° Ahhough the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it is readily apparent to 
those of ordinary skill in the art in light of the teachings of this invention that certain changes 
and modifications may be made thereto without departing from the spirit or scope of the 
appended claims. 

Deposit Information : 

The following materials were deposited with the American Type Culture Collection: 
CMCC = (Chiron Master Culture Collection) 



Cell Lines Deposited w ith ATCC 



Cell Line 



KM12L4-A 



Kml2C 



MDA-MB-231 



MCF-7 



Deposit Date 



March 19, 1998 



May 15, 1998 



May 15, 1998 



October 9, 1998 



ATCC Accession No. 



CRL- 12496 



CRL-12533 



CRL-12532 



CRL-12584 



CMCC Accession No. 



11606 



11611 



10583 



10377 
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WO 99/33982 
CDNA Library Du etto 

cDNA Library ESI - ATCC# 
Deposit Date - December 22. 1 QQK 
Clone Name Cluster ID " 

M00001395A.C03 4016 
M00001395A:C03 4016 
M00001449A:D12 3681 
M00001449A:D12 3681 
M00001452A:D08 1120 
M00001452A:D08 H20 

M00001513A.B06 4568 

M00001513A:B06 4568 

M00001517A:B07 4313 

M00001517A.B07 4313 

M00001533A.C11 2428 

M00001533A:C11 2428 

M00001533A:CII 2428 

M00001542A:A09 22113 



PCT/US98/27610 



c a*\ a a * n 



22113 



cDNA Library ES2 - ATCC# 
Deposit Date - December 22. 1998 



Clone Name 
M00001343C.F10 
M00001343C.F10 
M00001343C:F10 
M00001343D:H07 
M00001343D.H07 
M00001343D:H07 
M00001343D:H07 
M00001345A:E01 
M00001345A.E01 
M00001345A:E01 
M00001345A.E01 
M00001347A.B10 
M00001347A:B10 
M00001347A:B10 
M00001353A:G12 
M00001353A:G12 
M00001353A.G12 
MO0OO1353D.D1O 
M00001353D.-D10 
MOOOOI353D:D10 
M00001361A:A05 
M00001361A.A05 
M00001362B.D10 
M00001362B:D10 



Cluster ID 
2790 
2790 
2790 
23255 
23255 
23255 
23255 
6420 
6420 
6420 
6420 
13576 
13576 
13576 
8078 
8078 
8078 
14929 
14929 
14929 
4141 
4141 
5622 
5622 



Sequence Name 

79.Al.sp6:130016.Seq 

RTA00000118A.C.4.1 

RTA00000131A.g.l5.2 

79.El.sp6.130064.Seq 

79.C2.sp6.130041.Seq 

RTA00000118A.p.l5.3 

79.D4.sp6:130055.Seq 

RTA00000122A.d.l5.3 

79.F4.sp6:130079.Seq 

RTA00000122A.n.3.1 

RTA00000123A.1.21.1 

79.A5.sp6:130020.Seq 

RTAOOOOO 123A.1.2 1 . 1 .Seq_THC205063 

79.F5.sp6:130080.Sen 

RTAOOOOO 1 25 A.c.7 1* 



sequence Name 
80.El.sp6:130256.Seq 
RTA000001 77AF.e.2. 1 .Seq_THC22946 1 
RTAOOO0O177AF.e.2.1 
100.Cl.sp6:131446.Seq 
RTAOOOOO 1 77AF.e. 1 4.3 .Seq_THC228776 
80.Fl.sp6:130268.Seq 
RTA000001 77AF.e. 14 3 
172.El.sp6:133925.Seq 
RTA00000177AF.f.l0.3 

RTA00000177AF.f.l0.3.Seq THC226443 
80.Gl.sp6:130280.Seq 

80.D2.sp6:130245.Seq 

100.El.sp6:131470.Seq 

RTA00000177AF.g.l6.1 

80.E3.sp6:130258.Seq 

RTA00000177AR.I.13.1 

172.C3.sp6:133903.Seq 

RTA00000177AF.m.l.2 

80.F3.sp6:130270.Seq 

172.D3.sp6.133915.Seq 

80.B4.sp6:130223.Seq 

RTA00000177AF.p.20.3 

80.D4.sp6:130247.Seq 

RTA00000178AF.a.ll 1 
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BNSOOC1D: <WO 9933982A2_I. 



WO 99/33982 
cDNA Library ES3 - ATCC# 
Deposit Date - December 22, 1998 



PCTYUS98/27610 



Clone Name 
M00001362C:H11 
M00001362C:H11 
M00001362C:H11 
M00001362C:H11 
M00001376B:G06 
M00001376B:G06 
M00001387A:C05 
M00001387A:C05 
M00001412B:B10 
M00001412B:B10 
M00001415A:H06 
M00001415A:H06 
M00001416B:H11 
M00001416B:H11 
M00001429D:D07 
M00001429D:D07 
M00001448D:H01 
M00001448D:H01 

cDNA Library ES4 - ATCC# 
Deposit Date - December 22, 1998 



Cluster ID 


Sequence Name 


945 


RTA00000178AR.a.20.1 


945 


100.E4.sp6:131473.Seq 


945 


80.E4.sp6:130259.Seq 


945 


180.C2.sp6:135940.Seq 


17732 


RTA00000178AR.i.2.2 


17732 


80.B5.sp6:130224.Seq 


2464 


80.D6.sp6:130249.Seq 


2464 


RTA00000178AF.n.l8.1 


8551 


RTA00000179AF.p.21.1 


8551 


80.G7.sp6:130286.Seq 


13538 


80.B8.sp6:130227.Seq 


13538 


RTA000001 80AF.a.24.1 


8847 


80.C8.sp6:130239.Seq 


8847 


RTA000001 80AF.b.l6.1 


40392 


RTA00000180AF.j.8.1 


40392 


80.H9.sp6:130300.Seq 


36313 


80.All.sp6:130218.Seq 


36313 


RTA00000181AF.e.23.1 



Clone Name 
M00001463C:B11 
M00001463C:B11 
M00001470A:B10 
M00001470A:B10 
M00001497A:G02 
M00001497A:G02 
M00001500A-.E11 
M00001500A:E11 
M00001501D:C02 
M00001501D:C02 
M00001501D:C02 
M00001504C:H06 
M00001504C:H06 
M00001504C:H06 
M00001504D:G06 
M00001504D:G06 
M00001504D:G06 
M00001504D:G06 
M00001528A:C04 
M00001528A:C04 
M00001528A:C04 



Cluster ID Sequence Name 

1 9 RTA00000 1 82AF.b.7. 1 

19 89.Dl.sp6:l 30703 .Seq 

1037 89.F2.sp6:130728.Seq 

1037 RTA00000121A.f.8.1 

2623 89.F3.sp6:130729.Seq 

2623 RTA00000183AF.a.6.1 

2623 RTA00000183AF.b.l4.1 

2623 89.A4.sp6:130670.Seq 

9685 RTA00000 1 83 AF.c. 1 1 . 1 .Seq_THC 1 09544 

9685 RTA000001 83 AF.c. 1 1.1 

9685 89-C4.sp6:130694.Seq 

6974 89.F4.sp6:130730.Seq 

6974 RTA00000183AF.d.9.1 

6974 RTA00000 1 83 AF.d.9. 1 .Seq_THC223 129 

6420 173.F5.SP6:134133.Seq 

6420 89.G4.sp6:130742.Seq 

6420 RTA00000 1 83 AF.d. 1 1 . 1 .Seq_THC226443 

6420 RTA00000 1 83 AF.d. 11.1 

35555 89.B6.sp6:130684.Seq 

7337 RTA00000123A.b.l7.1 

35555 184.A5.sp6:135530.Seq 
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WO 99/33982 
cDNA Library ES5 - ATCC# 
Deposit Date - December 22. 1998 



PCT/US98/27610 



Clone Name 
M00001537B:G07 
M00001537B.-G07 
M00001541A:D02 
M00001541A:D02 
M00001544B:B07 
M00001544B:B07 
M00001546A.G11 
M00001546A:G11 
M00001549B:F06 
M00001549B:F06 
M00001556A:F11 
M00001556A:F11 
M00001556A:F11 
M00001556B.C08 
M00001556B:C08 



Cluster ID 

3389 

3389 

3765 

3765 

6974 
6974 

1267 

1267 
4193 
4193 
1577 
1577 
1577 
4386 
4386 



cDNA Library ES6 - ATCC# 
Deposit Date - December 22. 1998 



Clone Name 
M00001563B:F06 
M00001563B.-F06 
M00001571C:H06 
M00001571C:H06 
M00001594B:H04 
M00001594B:H04 
M00001597C.H02 
M00001597C:H02 
M00001624C.F01 
M00001624C:F01 
M00001679A:A06 
M00001679A.A06 
M00001679A:A06 
M00003759B:B09 
M00003759B:B09 
M00003759B.B09 
M00003844C:B11 
M00003844C.B11 
M00003844C.B11 
M00003857A:G10 
M00003857A.G10 



Cluster ID 
102 
102 
5749 
5749 
260 
260 
4837 
4837 
4309 
4309 
6660 
6660 
6660 
697 
697 
697 
6539 
6539 
6539 
3389 
3389 



Sequence Name 

RTA00000183AF.m.l9.1 

89.A8.sp6:130674.Seq 

89.C8.sp6:130698.Seq 

RTA0000O135A.d.l.l 

89.A9.sp6:130675.Seq 

RTA00000184AF.a.l5.1 

89.D9.sp6:130711.Seq 

RTA00000125A.O.5.1 

89.G9.sp6:130747.Seq 

RTA00000184AF.e.l3.1 

173.C9.SP6:134101.Seq 

89.Fll.sp6:130737.Seq 

RTA00000184AF.i.23 1 

RTA00000184AF.j.4.1 

89.Hll.sp6:130761.Seq 



Sequence Name ~ 

RTA00000184AF.O.5.1 

90.Bl.sp6:130871.Seq 

90.El.sp6:130907.Seq 

RTA00000185AF.a.l9.1 

90.D2.sp6:130896.Seq 

RTA00000185ARJ.I2.2 

90.E2.sp6.130908.Seq 

RTA00000185AR.k.3.2 

90.C4.sp6:130886.Seq 

RTA00000186AF.e.22.1 

90.F6.sp6.130924.Seq 

122.B5.sp6.132089.Seq 

RTA00000187AF.h. 15.1 

90.G8.sp6:130938.Seq 

RTA00000188AF.d.6.1 

RTA00000 1 88AF.d.6. 1 .Seq_THC 1 78884 

176.D9.sp6:134556.Seq 

RTA00000189AF.d.22.1 

90.B10.sp6:130880.Seq 

90.All.sp6:130869.Seq 

RTA00000189AF.g.3 1 
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BNSDOCID: <WO 9933982A2J. 



WO 99/33982 
cDNA Library ES7 - ATCC# 



PCT/US98/27610 



Clone Name 

M00003914C:F05 

M00003914C:F05 

M00003922A:E06 

M00003922A:E06 

M00003922A:E06 

MO0003983A:AO5 

M00003983A:A05 

M00004028D:A06 

M00004028D:A06 

M00004031A:A12 

M00004031A:A12 

M00004087D:A01 

M00004087D-.A01 

M00004108A:E06 

M00004108A:E06 

M00004114C:F11 

M00004114C:F11 

M00004114C:F11 



cDNA Library 
Deposit Date 
Clone Name 
M00004146C; 
M00004146C 
M00004146C 
M00004146C: 
M00004157C 
M00004157C 
M00004157C 
M00004172C 
M00004172C: 
M00004172C 
M00004229B 
M00004229B 



ES8 - ATCC# 
Dec ember 22, 1998 
Cluster ID 



Cll 
Cll 
:C11 
Cll 
A09 
A09 
:A09 
:D08 
:D08 
:D08 
:F08 
:F08 



5257 

5257 

5257 

5257 

6455 

6455 

6455 

11494 

11494 

11494 

6455 

6455 



Cluster ID 


Sequence Name 


3900 


99.El.sp6:131278.Seq 


3900 


RTA00000190AF.g.l3.1 


23255 


RTA00000190AF.j-4.1 


23255 


99.Fl.sp6:131290.Seq 


23255 


RTA00000190AF.j.4.1.Seq_THC228776 


9105 


99.C3.sp6:131256.Seq 


9105 


RTA00000191AF.a.21.2 


6124 


RTA00000191AR.e.2.3 


6124 


99.D3.sp6:131268.Seq 




RTA00000191AR.e.ll.2 


9061 


RTA00000191AR.e.ll.3 


6880 


RTA00000191AF.m.20.1 


6880 


99.A5.sp6:131234.Seq 


4937 


99.E5.sp6:131282.Seq 


4937 


RTA00000191AF.p.21.1 


13183 


123.D5.sp6:132305.Seq 


13183 


RTAOOOOO 1 92 AF.a.24. 1 


13183 


99.G5.sp6:131306.Seq 



Sequence Name 
99.B6.sp6:131247.Seq 
177.F5.sp6:134768.Seq 

RTA00000192AF.f.3.1 
RTA00000192AF.f.3.1.Seq_THC2l3833 

RTAOOOOO 1 92AF.g.23 . 1 

99.D6.sp6:131271.Seq 

123.E7.sp6:132319,Seq 

RTA00000192AF.j.6.1 

99.G6.sp6:131307.Seq 

177.E6.sp6:134757.Seq 

RTA00000193AF.D.9.1 

99.C8.sp6: 13 1261. Seq 



cDNA Library ES9 - ATCC# 



Clone Name 

M00001466A:E07 

M00001531A:H11 

M00001531A:H11 

MO0OO1551A-.B10 

M00001551A:B10 

M00001551A:B10 

M00001552A:B12 

M00001552A:B12 

M00001556A-.H01 

M00001586C:C05 

M00001604A:B10 

M00001604A-.B10 

M00003879B:C11 

M00003879B:C11 



Cluster ID 


Sequence Name 


4275 


RTA00000120A.j.l4.1 




89.F6.sp6:130732.Seq 




RTAOOOOO 123A.g. 19.1 


6268 


79.G9.sp6:130096.Seq 


6268 


184.C12.sp6:135561.Seq 


6268 


RTA00000126A.O.23.1 


307 


RTAOOOOO 136A.0.4 .2 


307 


79.C7.sp6:130046.Seq 


15855 


RTA00000184AF.j.l.l 


4623 


RTAOOOOO 1 85 AF.f.4.1 


1399 


79.G8.sp6:130095.Seq 


1399 


RTA00000129A.O.10.1 


5345 


RTA00000189AF.1.19.1 


5345 


90.B12.sp6:130882.Seq 
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WO 99/33982 
cDNA Library ES 1 0 - ATCC# 
Deposit D ate - December 22, 1998 



PCT/US98/27610 



Clone Name 
M00001358C:C06 
'M00001388D:G05 
M00001388D:G05 
M00001394A:F01 
M00001394A:F01 
M00001394A:F01 
M00001429A.H04 
M00001447A:G03 
M00001448D:C09 
M00001448D:C09 
M00001448D:C09 
M00001454D:G03 



Cluster ID 

5832 

5832 

6583 

6583 

6583 

2797 

10717 

8 

8 

8 

689 



cDNA Library ESI 1 - ATCC# 
Deposit Date - December 22. 1998 



Clone Name 
M00003975A:G11 
M00003978B.G05 
M00003978B:G05 
M00004059A:D06 
M00004068B:A01 
M00004068B.-A01 
M00004205D:F06 
M00004205D:F06 
M00004205D.F06 
M00004212B:C07 
M00004223A.-G10 



Cluster ID 

12439 

5693 

5693 

5417 

3706 

3706 



2379 
16918 



cDNA Library ESI 2 - ATCC# 
Deposit Date - December 22, 1998 
Clone Name Cluster ID 

M00004223B:D09 7899 
M00004249D:G12 
M00004251C.G07 
M00004372A:A03 2030 



Sequence Name 

RTA00000177AF.O.4.3 

80.F6.sp6:130273.Seq 

RTA00000178AF.O.23.1 

RTA00000179AF.d.l3.1 

172.B8.sp6:133896.Seq 

80.H6.sp6:130297.Seq 

RTA00000180AF.i.l9.1 

RTA00000181AF.d.l0.1 

80.H10.sp6:130301.Seq 

RTA00000181AF.e.l7.1 

100.Bll.sp6.131444.Seq 

RTA00000181AR.1.22.1 



S^cucnce Nsr"** " " ~ 

RTA00000190AF.O.24.1 

RTA000001 90AF.p. 1 7.2.Seq_THC 1 733 1 8 

RTA00000190AF.p.l7.2 

RTA00000191AF.h.l9.1 

99.C4.sp6:131257.Seq 

RTA00000191AF.U7.2 

99.E7.sp6:131284.Seq 

177.G7.sp6:134782.Seq 

RTA00000192AF.O.11 1 

RTA00000192AF.p.8.1 

RTA00000193AF.a.l6.1 



Sequence Name 

RTA00000193AF.a.l7.1 

RTA00000193AF.C.22.1 

RTA00000193AF.d.2.1 

RTA00000193AF.m.20.1 
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BNSDOCIO: <WO 9933982A2J_> 



WO 99/33982 
cDNA Library ESI 3 - ATCC# 
Deposit Date - December 22, 1998 

Clone Name Cluster ID 

M00001340B:A06 17062 

M00001340B:A06 17062 

M00001340D:F10 H589 

M00001340D-.F10 11589 

M00001341A:E12 4443 

M00001341A:E12 4443 

M00001342B-.E06 39805 

M00001342B:E06 39805 

M00001346A-.F09 5007 

M00001346A-.F09 5007 

M00001346D:G06 5779 

M00001346D:G06 5779 

M00001348B.B04 16927 

MO000 1 348B:B04 1 6927 

M00001348B:G06 16985 

M00001348B:G06 16985 

M00001349B:B08 3584 

M00001349B-.B08 3584 

M00001350A:H01 7187 

M00001350A:H01 7187 

M00001350A:H01 7187 

M00001352A:E02 16245 

M00001352A:E02 16245 

M00001352A-.E02 16245 

M00001355B:G10 14391 

M00001355B:G10 14391 

M00001355B-.G10 14391 

MO00O1355B:G10 14391 

M00001361D:F08 2379 

M00001361D-.F08 2379 

M00001365C:C10 40132 

M00001365C:C10 40132 
M00001368D:E03 
M00001368D:E03 

M00001370A:C09 6867 

M00001370A:C09 6867 

M00001371C-.E09 7172 

M00001371C:E09 7172 

M00001371C:E09 7172 

M00001378B:B02 39833 

M00001378B-.B02 39833 

M00001379A:A05 1334 

M00001379A:A05 1334 

M00001380D:B09 39886 

M00001380D:B09 39886 
M00001381D:E06 
M00001381D-.E06 

M00001382C:A02 22979 

M00001382C:A02 22979 
M00001384B.A1 1 
M00001384B:A11 

M00001386C:B12 5178 



PCT/US98/27610 



Sequence Name 

80.Al.sp6:130208.Seq 

RTA00000177AF.b.8.4 

80.Bl.sp6:130220.Seq 

RTA00000177AF.b.l7.4 

80.Cl.sp6:130232.Seq 

RTA00000177AF.b.20.4 

80.Dl.sp6:130244.Seq 

RTA00000177AF.C.21.3 

RTA00000177AF.g.2.1 

80.Hl.sp6:130292.Seq 

RTA00000 1 77AF.g. 14.3 

RTA00000177AF.g.l4.1 

80.E2.sp6:130257.Seq 

RTA00000177AF.h.9.3 

RTA00000 1 77 AF.h. 10.1 

80.F2.sp6:130269.Seq 

RTA00000 1 77AF .h.20. 1 

80.G2.sp6:130281.Seq 

100.C2.sp6:131447.Seq 

80.A3.sp6:130210.Seq 

RTA00000177AF.i.8.2 

RTA00000177AF.k.9.3 

172.D2.sp6:133914.Seq 

80.D3.sp6:130246.Seq 

RTA00000177AF.m.l7.3 

80.G3.sp6:130282.Seq 

172.H3.sp6:133963.Seq 

100.E3.sp6:131472.Seq 

80.C4.sp6:130235.Seq 

RTA00000 1 78AF.a.6. 1 

RTA00000178AF.C.7.1 

80.F4.sp6:130271.Seq 

80.G4.sp6:130283.Seq 

RTA00000 1 78 AF.d.20. 1 

80.H4.sp6:130295.Seq 

RTA00000178AF.e.l2.1 

10O.A5.sp6:131426.Seq 

RTA00000178AF.f.9.1 

80.A5.sp6:130212.Seq 

80.C5.sp6:130236.Seq 

RTA00000178AF.L23.1 

80.D5.sp6:130248.Seq 

RTA00000178AF.j.7.1 

RTA00000178AF.j.24.1 

80.E5.sp6:130260.Seq 

8O.F5.sp6:130272.Seq 

RTA00000178AF.k.l6.1 

80.G5.sp6:130284.Seq 

RTA00000178AF.k.22.1 

80.B6.sp6:130225.Seq 

RTA00000178AF.m.l3.1 

80.C6.sp6:130237.Seq 
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<WO 9933982A2J_> 



WO 99/33982 
cDNA Library ESI 3 - ATCC# 
Deposit Date - December 22. 1 998 

Clone Name Cluster ID 

M00001386C.B12 5178 

M0000I387B.-G03 7587 

M00001387B:G03 7587 

M00001389A:C08 16269 

M00001389A:C08 16269 

M00001396A:C03 4009 

M00001396A:C03 4009 

M00001396A:C03 4009 
M00001400B:H06 
M00001400B:H06 
M00001400B.H06 
M00001400B:H06 

M00001402A:E08 39563 

M00001402A:E08 39563 

M00001407B.D1 1 5556 

M00001407B.D11 5556 

M000014iOA:D07 7005 

M00001410A:D07 7005 

M00001410A:D07 7005 
M00001414A:B01 
M00001414A.B01 
M00001414C.A07 
M00001414C:A07 

M00001416A:H01 7674 

M00001416A:H01 7674 

M00001417A:E02 36393 

M00001417A:E02 36393 

M00001423B.E07 15066 

M00001423B.E07 15066 

M00001424B.G09 10470 

M00001424B:G09 10470 

M00001425B.H08 22195 

M00001425B:H08 22195 
M00001426B:D12 
M00001426B:D12 

M0OOOI426D.C08 4261 

MO00O1426D:C08 4261 

M00001428A.H10 84182 

M00001428A:HIO 84182 

M00001428A:H10 84182 

M00001449A.A12 5857 

M00001449A.-A12 5857 

M00001449A.B12 41633 

M00001449A:B12 41633 

M00001449A:G10 36535 

M00001449A:G10 36535 

M00001449A.G10 36535 

M00001449C.D06 86110 

M00001449C.D06 86110 

M00001450A.A02 39304 

M00001450A:A02 39304 

M00001450A:A02 39304 



PCT/US98/27610 



sequence Name 

RTA00000178AF.n.l0.1 

80.E6.sp6:130261.Seq 

RTA00000178AF.n.24 1 

RTA00000178AF.p.l.l 

80.G6.sp6:130285.Seq 

172.D8.sp6.133920.Seq 

80.A7.sp6:130214.Seq 

RTA00000179AF.e.20 1 

172.B9.sp6:133897.Seq 

80.B7.sp6:130226.Seq 

RTA00000179AF.j.l3.1 

RTA00000 1 79AF.J. 13.1 .SeqTHC 1 05720 

80.C7.sp6.130238.Seq 

RTA00000179AF.k.20.1 

RTA00000179AF.n.l0.1 

80.D7.sp6: 1 30250.Seq 

180.H5.sp6:136003Tseq 

RTA00000179AF.O.22 1 

80.F7.sp6:130274.Seq 

RTA00000180AF.a.9.1 

80.H7.sp6:130298.Seq 

80.A8.sp6:130215.Seq 

RTA00000180AF.a.ll.l 

79. Cl.sp6:130040.Seq 
RTA00000118A.g.9.1 
RTA00000180AF.C.2.1 

80. D8.sp6:130251.Seq 

RTA00000180AF.e.24.1 

80.H8.sp6:130299.Seq 

80.A9.sp6:130216.Seq 

RTA00000180AF.f.l8.1 

RTA00000180AF.g.7.1 

80.B9.sp6.130228.Seq 

RTA00000180AF.g.22.1 

80.C9.sp6:130240.Seq 

80.D9.sp6:130252.Seq 

RTA00000180AF.h.5.1 

100.G9.sp6:131502.Seq 

RTA00000180AF.h.l9.1 

80.E9.sp6:130264.Seq 

80.Bll.sp6:130230.Seq 

RTA00000118A.g.l4.1 

80.Cll.sp6:130242.Seq 

RTA00000118A.g.I6.1 

RTA00000181AF.f.5.1 

80.Dll.sp6:130254.Seq 

100.Dll.sp6:131468.Seq 

RTA00000181AF.f.l2.1 

80.Ell.sp6:130266.Seq 

RTA00000 1 1 8A.j.2 1 . 1 .SeqTHC 1 5 1 859 
RTA00000118A.J.21.1 
79.Fl.sp6:130076.Seq 
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Clone Name 
M00001450A:A02 
M00001450A:A1 1 
M00001450A:A1 1 
M00001450A:B12 
M00001450A:B12 
M00001450A:B12 
M00001450A:D08 
M00001450A:D08 
M00001450A:D08 
M00001452A:B04 
M00001452A:B04 
M00001452A:B12 
M00001452A:B12 
M00001452A:F05 
M00001452A-.F05 
M00001452C:B06 
M00001452C:B06 
M00001452C-.B06 
M00001453A:E11 
M00001453A-.E11 
M00001453A:E11 
M00001453C:F06 
M00001453C:F06 
M00001454A:A09 
M00001454A:A09 
M00001454B:C12 
M00001454B:C12 
M00001454B:C12 
M00001455B:E12 
M00001455B:E12 
M00001460A:F06 
M00001460A:F06 
M00001461A:D06 
M00001461A:D06 
M00001465A:B11 
M00001465A:B11 
M00001467A:B07 
M00001467A-.B07 
M00001467A:D04 
M00001467A:D04 
M00001467A:E10 
M00001467A:E10 
M00001468A:F05 
M00001468A:F05 
M00001469A:A01 
M00001469A-.A01 
M00001469A-.C10 
M00001469A:C10 
M00001469A:H12 
M00001469A:H12 
M00001470A:C04 
M00001470A:C04 



Cluster ID Sequence Name 

39304 180.G9.sp6:135995.Seq 

32663 80.F1 1 .sp6:130278.Seq 

32663 RTA00000118A.1.8.1 

82498 1 00.F 1 1 .sp6: 1 3 1 492.Seq 

82498 RTA00000118A.m.l0.1 

82498 79.Gl.sp6:130088.Seq 

27250 80.G 1 1 .sp6: 1 30290.Seq 

27250 180.B10.sp6:135936.Seq 

27250 RTA00000181AF.g.l0.1 

84328 RTA00000118A.p.l0.1 

84328 79.A2.sp6:130017.Seq 

86859 RTA000001 1 8A.p.8.1 

86859 79.B2.sp6:130029.Seq 

85064 RTA00000131A.m.23.1 

85064 79.D2.sp6:130053.Seq 

16970 80.Hll.sp6:130302.Seq 

16970 100.C12.sp6:131457.Seq 

16970 RTA00000181AR.I.18.2 

16130 80.A12.sp6:130219.Seq 

16130 100.D12.sp6:131469.Seq 

16130 RTA00000119A.C.13.1 

16653 80.B12.sp6:130231.Seq 

16653 RTA00000181AF.k.5.3 

83 1 03 RTA000001 1 9A.e.24.2 

83103 79.G2.sp6:130089.Seq 
7005 121.Dl.sp6:131917.Seq 

7005 RTA00000181AF.k.24.1 
7005 80.C12.sp6:130243.Seq 
13072 8O.F12.sp6:130279.Seq 
13072 " RTA00000181AR.m.5.2 
2448 89.Al.sp6:130667.Seq 
2448 RTA000001 1 9A.j.2 1 . 1 

1531 89.Cl.sp6:l 30691. Seq 

1 53 1 RTA000001 1 9A.o.3 . 1 

10145 79.F3.sp6:130078.Seq 
10145 RTA00000120A.g.l2.1 
38759 89.Fl.sp6:130727.Seq 
38759 RTA00000120A.m.l2.3 
39508 RTA00000120A.O.2.1 
39508 89.Gl.sp6:130739.Seq 
39442 89.A2.sp6:130668.Seq 
39442 RTA00000120A.O.21.1 
7589 RTA00000120A.p.23.1 
7589 89.B2.sp6:130680.Seq 
RTA00000121A.C.10.1 
89.C2.sp6:130692.Seq 
12081 89.D2.sp6:130704.Seq 
12081 RTA00000133A.d.l4.2 
19105 89.E2.sp6:130716.Seq 
19105 RTA00000133A.e.l5.1 
39425 89.G2.sp6:130740.Seq 
39425 RTA00000133A.f.l.l 
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M00001471A.-B01 39478 

M00001471A:B01 39478 
M00001487B:H06 
M00001487B:H06 
M00001488B:F12 
M00001488B.F1 2 

M00001494D.F06 7206 

M00001494D.F06 7206 

M00001499B:A11 10539 

M00001499B:A11 10539 

M00001499B:A11 10539 

M0O0015OOA:C05 5336 

M00001500A:C05 5336 
M00001504A.E01 
M00001504A:E01 
M00001504A:E01 

M0000i5G4C:AO7 10185 

M00001504C:A07 10185 
M00001505C:C05 
M00001505C:C05 
M00001506D.A09 
M00001506D.-A09 
M00001506D:A09 

M00001507A.H05 39168 

M00001507A:H05 39168 

M00001535A:F10 39423 

M00001535A:F10 39423 

M00001541A:H03 39174 

M00001541A.H03 39174 

M00001544A.G02 19829 

M0000 1 544A:G02 1 9829 

M00001545A:D08 13864 

M00001545A:D08 13864 

M00001551A:F05 39180 

M00001551A:F05 39180 

M0000I552A.D11 39458 

M00001552A:D11 39458 

M00001557A:F03 39490 
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Sequence Name 

89.H2.sp6:130752.Seq 

RTA00000133A.i.5.1 

RTA00000182AF.I.15.1 

89.B3.sp6:l 30681. Seq 

RTA00000182AF.1.20.1 

89.C3.sp6:130693.Seq 

RTA00000182AF.O.15.1 

89.E3.sp6.130717.Seq 

RTA00000183AF.a.24.1 

89.G3.sp6: 1 30741. Seq 

173.B5.SP6:134085.Seq 

RTA00000183AF.b.l3.1 

89.H3.sp6: 130753. Seq 

RTA00000183AF.C.24.1 

89.D4.sp6:130706.Seq 

RTA00000 1 83 AF.c.24. 1 .Sen THr > 2^1 1 

RTA00000183AF.d.5.1 "~ 

89.E4.sp6:130718.Seq 

89.H4.sp6:130754.Seq 

RTA00000183AF.e.l.l 

89.A5.sp6:130671.Seq 

RTA00000183AF.e.23.1 

121.G6.sp6.131958.Seq 

RTA00000121A.1.10.1 

89.B5.sp6:130683.Seq 

79.C5.sp6:130044.Seq 

RTA00000134A.k.22.1 

79.E5.sp6:130068.Seq 

RTA00000124A.n.l3.1 

79.H5.sp6.130104.Seq 

RTA00000125A.h.24.4 

RTA00000125A.m.9.1 

79.B6.sp6: 130033. Seq 

RTA00000126A.n.8.2 

79.A7.sp6:130022.Seq 

RTA00000126A.p.l5.2 

79.D7.sp6:130058.Seq 

RTA00000128A.b.4.1 
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Clone Name Cluster ID 

M00001511A:H06 39412 

M00001511A:H06 39412 

M00001512A:A09 39186 

M00001512A-.A09 39186 

M00001512D:G09 3956 

M00001512D:G09 3956 

M00001512D:G09 3956 
M00001513B-.G03 
M00OO1513B:G03 
M00001513B:G03 

M00001513C:E08 14364 

M00001513C:E08 14364 

M00001514C:D11 40044 

M00001514C:D11 40044 

M00001514C:D11 40044 

M00001518C:B11 8952 

M00001518C:B11 8952 

M00001528B:H04 8358 

M00001528B:H04 8358 

M00001531A:D01 38085 

M00001531A-.D01 38085 

M00001534A:C04 16921 

M00001534A:C04 16921 

M00001534A:D09 5097 

M00001534A:D09 5097 

M00001534C:A01 4119 

M00001534C:A01 4119 

M00001535A:C06 20212 

M00001535A-.C06 20212 

M00001535A:C06 20212 

M00001536A:B07 2696 

M00001536A:B07 2696 

M00001537A-.F12 39420 

M00001537A:F12 39420 

M00001540A-.D06 8286 

M00001540A:D06 8286 

M00001542A:E06 39453 

M00001542A:E06 39453 
M00001544A-.E06 
M00001544A:E06 
M00001544A:E06 
M00001545A-.B02 
M00001545A:B02 

M000O1548A-.E1O 5892 

M00001548A:E10 5892 

M00001548A-.E10 5892 

M00001549C:E06 16347 

M0000 1 549C:E06 1 6347 

M00001550A:A03 7239 

M00001550A:A03 7239 

M00001550A:G01 5175 

M00001550A:G01 5175 
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Sequence Name 

RTA00000133A.k.l7.1 

89.C5.sp6:130695.Seq 

89.D5.sp6:130707.Seq 

RTA00000121A.p.l5.1 

89.E5.sp6:130719.Seq 

173.H5.SP6:134157.Seq 

RTA00000183AF.g.3.1 

RTA00000183AF.g.9.1 

89 F5.sp6:130731.Seq 

RTA00000 1 83 AF.g.9. 1 .Seq_THC 1 98280 

RTA00000183AF.g.l2.1 

89.G5.sp6:130743.Seq 

RTA00000183AF.g.22.1 

RTA00000183AF.g.22.1.Seq_THC232899 

89.H5.sp6:130755.Seq 

89.A6.sp6:130672.Seq 

RTA00000183AF.h.l5.1 

89.D6.sp6:130708.Seq 

RTA00000183AF.i.5.1 

RTA00000123A.e.l5.1 

89.E6.sp6:130720.Seq 

RTA00000183AF.k.6.1 

89.H6.sp6:130756.Seq 

RTA00000134A.k.l.l 

RTA00000 1 34A.k. 1 . 1 .Seq_THC2 1 5869 

RTA00000183AF.k.l6.1 

89.C7.sp6:130697.Seq 

89.E7.sp6:130721.Seq 
RTA00000134A.1.22.1.Seq_THC128232 

RTA00000134A.1.22.1 

RTA00000 1 34A.m . 1 3 . 1 

89.F7.sp6:130733.Seq 

89.H7.sp6:130757.Seq 

RTA00000134A.O.23.1 

89.B8.sp6:130686.Seq 

RTA00000183AF.O.1.1 

89.E8.sp6:130722.Seq 

RTA00000135A.g.H.l 

RTA00000184AF.a.8.1 

173.G7.SP6:134147.Seq 

89.H8.sp6:130758.Seq 

89.B9.sp6:130687.Seq 

RTA00000135A.1.2.2 

89.E9.sp6:130723.Seq 

RTA00000184AF.d.ll-l 

RTA00000184AF.d.l 1 .1 .Seq_THC161896 

89.H9.sp6:130759.Seq 

RTA000001 84AF.e.l 5.1 

89.A10.sp6:130676.Seq 

RTA00000126A.m.4.2 

RTA00000184AF.f.3.1 

89.B10.sp6:130688.Seq 
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M00001551A.-G06 
M00001551A.-G06 
M0O001551C:G09 
M00001551C:G09 
M00001553A:H06 
M00001553A:H06 
M00001553B.F12 
M0000I553B:F12 
M00001555A:B02 
M00001555A.B02 
M00001555A:C01 
MOO0O1555A.C01 
M00001555D:G10 
M00001555D:G10 
M00001556A:C09 
M00001556A:C09 
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iv/rnnnni cc/n.r.A^ 

x»* vv w i ^^UO.VJUZ 

M0000I556B.G02 
M00001557B.H10 
M00001557B:H10 
M00001557B.H10 
M00001557D:D09 
M00001557D.D09 
M00001558B:H11 
M00001558B:H11 
M00001559B:F01 
M00001559B:F01 
M00001560D:F10 6558 
M00001560D.-F10 6558 
M00001566B:D11 
M00001566B.D1 1 
M00001583D:A10 6293 
M00001583D:A10 6293 
M00001590B:F03 
M00001590B:F03 
M00001597D:C05 10470 
M00001597D.C05 10470 
M00001598A:G03 16999 
M00001598A:G03 16999 
M00001601A.D08 22794 
M00001601A:D08 22794 
M00001607A:E11 11465 
M00001607A:E11 11465 
M00001608A:B03 7802 
M00001608A:B03 7802 
M00001608B.E03 22155 
M00001608B.E03 22155 
M00001608D:A11 
M00001608D.A11 
M00001614C:F10 13157 
M00001614C:F10 13157 
M00001617C:E02 17004 



Cluster ID 
22390 
22390 
3266 
3266 
8298 
8298 
4573 
4573 
39539 
39539 
39195 
39195 
4561 
4561 
9244 
9244 
i 1294 
11294 
5192 
5192 
5192 
8761 
8761 
7514 
7514 



Sequence Name 
RTA00000136A.j.l3.1 
89.C10.sp6:130700.Seq 
RTA00000184AR.g.l.l 
89.D10.sp6:130712.Seq 
RTA00000127A.d.l9.1 
89.G10.sp6.130748.Seq 
89.HIO.sp6:130760.Seq 
RTA00000184AF.h.9.1 
RTA00000127A.i.21.1 
89.Bll.sp6:130689.Seq 
89.C1 l.sp6: 130701. Seq 
RTA00000137A.C.16.1 
RTA00000184AF.i.21.1 
89.Dll.sp6:130713.Seq 
89.Ell.sp6:130725.Seq 
RTA00000127A.1.3.1 
RTA000001 84 AF.j.6.1 
89.A12.sp6: 1 30678.Seq 
173.E9.SP6:134125.Seq 
RTA00000184AF.k.2.1 
89.D12.sp6:130714.Seq 
RTA00000184AF.k.l2.1 
89.E12.sp6:130726.Seq 
RTA00000184AF.k.21.1 
89.G12.sp6:130750.Seq 

89. H12.sp6:130762.Seq 
RTA00000184AF.1.11.1 

90. Al.sp6:130859.Seq 
RTA00000184AF.m.21 1 
RTA00000184AF.p.3.1 
90.Dl.sp6:130895.Seq 
RTA00000185AF.e.ll.l 
90.A2.sp6:130860.Seq 
RTA00000185AF.g.ll.l 
9O.C2.sp6:130884.Seq 
RTA00000185AF.k.6.1 
90.F2.sp6:130920.Seq 
90.G2.sp6.130932.Seq 
RTA00000185AF.k.9 1 
RTA00000138A.b.5.1 
90.H2.sp6:130944.Seq 
RTA00000185AF.m.l9.1 
90.A3.sp6:130861.Seq 
RTA00000185AF.n.5.1 
90.B3.sp6.130873.Seq 
RTA00000185AF.n.9.1 
90.C3.sp6:130885.Seq 
RTA00000l85AF.n.l2 1 
90.D3.sp6:130897.Seq 
RTA00000186AF.a.6.1 
90.E3.sp6:130909.Seq 
RTA00000186AF.b.21.1 
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M00001617C:E02 17004 

M00001619C.F12 40314 

M00001619C:F12 40314 

M00001621C-.C08 40044 

M00001621C:C08 40044 

M00001621C:C08 40044 

M00001621C:C08 40044 

M00001623D:F10 13913 

M00001623D-.F10 13913 
M00001632D:H07 
M00001632D:H07 
M00001632D:H07 
M00001632D:H07 

M00001644C:B07 39171 

M00001644C:B07 39171 

M00001644C:B07 39171 

M00001645A:C12 19267 

M00001645A:C12 19267 

M00001645A:C12 19267 

M00001645A-.C12 19267 

M00001648C-.A01 4665 

M00001648C:A01 4665 

M00001657D:C03 23201 

M00001657D:C03 23201 

M00001657D-.F08 76760 

M00001657D:F08 76760 

M00001662C:A09 23218 

M00001662C:A09 23218 

M00001663A-.E04 35702 

M00001663A:E04 35702 

M00001669B:F02 6468 

M00001669B:F02 6468 

M00001670C-.H02 14367 

M00001670C:H02 14367 

M00001673C:H02 7015 

M00001673C:H02 7015 

M00001675A:C09 8773 

M00001675A:C09 8773 

M00001675A:C09 8773 

M00001676B:F05 11460 

M00001676B:F05 11460 

M00001676B:F05 H460 

M00001677D-.A07 7570 

M00001677D:A07 7570 

M00001677D-.A07 7570 

M00001678D:F12 4416 

M00001678D:F12 4416 

M00001679A:F10 26875 

M00001679A:F10 26875 

M00001679B:F01 6298 

M00001679B:F01 6298 

M00001680D:F08 10539 
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Sequence Name 

90.F3.sp6:130921.Seq 

90.G3.sp6:130933.Seq 

RTA00000186AF.C.15.1 

RTA00000186AF.d.l.l _„„ 0 ™ 
RTA00000186AF.d.l.l.Seq_THC232899 

90.H3.sp6:130945.Seq 

122.El.sp6:132121.Seq 

RTA00000186AF.e.6.1 

90.A4.sp6:130862.Seq 

RTA00000 1 86AF.h. 1 4. 1 .Seq_THC 1 1 2525 

RTA00000 1 86AF.h. 14.1 

90.E4.sp6:130910.Seq 

176.A3.sp6:134514.Seq 

RTA00000186AF.1.7.1 

90.F4.sp6:130922.Seq 

217.A12.sp6:139369.Seq 

RTA00000 1 86AF.1. 1 2. 1 .SeqJTHC 1 781 83 

176.G3.sp6:134586.Seq 

RTA00000186AF.1.12.1 

90.G4.sp6:130934.Seq 

90.H4.sp6:130946.Seq 

RTA00000186AF.m.3.1 

RTA000001 87AF.a. 14. 1 

90.B5.sp6:l 30875 .Seq 

90.C5.sp6:130887.Seq 

RTA00000187AF.a.l5.1 

RTA00000187AR.C.5.2 

90.D5.sp6:130899.Seq 

90.E5.sp6:130911.Seq 

RTA00000187AR.C.15.2 

90.F5.sp6:130923.Seq 

RTA000001 87AF.d. 1 5 . 1 

90.G5.sp6:130935.Seq 

RTA00000187AF.e.8.1 

90.H5.sp6:130947.Seq 

RTA00000187AF.f.l8.1 

RTA00000187AF.f.24.1 

90.A6.sp6:130864.Seq 
RTA00000187AF.f.24.1.Seq_THC220002 

RTA00000187AF.g.l2.1 

90.B6.sp6:130876.Seq 

219.F2.sp6:139035.Seq 

90.D6.sp6:130900.Seq 

RTA00000187AF.g.24.1 

RTAOOOOO 1 87AF.g.24. 1 .Seq_THC 1 68636 

90.E6.sp6:130912.Seq 

RTAOOOOO 187AF.h. 13.1 

RTA00000187AF.i.l.l 

90.A7.sp6:130865.Seq 

90.B7.sp6:130877.Seq 

RTA00000187AR.i.l0.2 

90.F7.sp6:130925.Seq 
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M00001680D.F08 10539 
M00001680D:F08 10539 
M00001682C.BI2 17055 
M00001682C:B12 17055 
M00001682C.B12 17055 
M00001688C:F09 5382 
M00001688C:F09 5382 
M00001693C:G01 4393 
M00001693C.G01 4393 
M00001716D:H05 67252 
M00001716D.H05 67252 
M00003741D.C09 40108 
M00003741D.C09 40108 
M00003747D:C05 1 1476 

M00003747D:C05 11476 
M00003747D:C05 1 1476 

"iuwwwj /h/l»:v^io 11476 
M00003754C:E09 
M00003754C:E09 
M00003761D:A09 
M00003761D:A09 
M00003761D:A09 
M00003762C.-B08 
M00003762C:B08 
M00003762C:B08 
M00003763A:F06 
M00003763A.F06 
M00003774C:A03 
M00003774C:A03 
M00003774C:A03 
M00003784D.D12 
M00003784D:D12 
M00003839A:D08 
M00003839A:D08 
M00003851B:D08 
M00003851B.D08 
M00003851B:D10 
M00003851B:D10 
M00003853A.D04 
M00003853A:D04 
M00003853A:F12 
M00003853A:F12 
M00003856B.-C02 
M00003856B:C02 
M00003857A:HO3 
M00003857A:H03 
M00003857A:H03 
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17076 

17076 

17076 

3108 

3108 

67907 

67907 

67907 



7798 
7798 



13595 

13595 

5619 

5619 

10515 

10515 

4622 

4622 

4718 

4718 

4718 



Sequence Name " 

219.F6.sp6:139039.Seq 

RTA00000187AF.1.7.1 

90.G7.sp6.130937.Seq 

RTA00000187AF.m.3.1 

176.D6.sp6:l 34553. Seq 

90.A8.sp6.i30866.Seq 

RTA00000187AF.m.23.2 

RTA00000187AF.n.l7.1 

90.B8.sp6:130878.Seq 

RTA00000187AF.O.6.1 

90.C8.sp6:130890.Seq 

90.D8.sp6:130902.Seq 

RTA00000187AF.O.24.1 

RTA00000187AF.p.l9.1 

90.E8.sp6:130914.Seq 

RTA000001 87AF.D. 1 9. 1 .Sen TUr 1 n»/i «n 

219.H8.sp6.139065.Seq *~ " 

90.F8.sp6:130926.Seq 

RTA00000188AF.b.l2.1 

RTA00000188AF.d.ll.l 

90.H8.sp6:130950.Seq 

RTA00000 1 88AF.d. 11.1 .Seq_THC2 1 2094 

RTA00000 1 88AF.d.2 1 . 1 .SeqTHC208760 

90.A9.sp6.130867.Seq 

RTA00000188AF.d.21.1 

RTA00000188AF.d.24.1 

90.B9.sp6:130879.Seq 

RTA000001 88AF.g.l 1 .1 .Seq_THC 123222 

RTA00000188AF.g.]].l 

90.C9.sp6: 130891. Seq 

RTAOOOOOI88AF.1.8.I 

90.D9.sp6: 130903 .Seq 

RTA00000189AF.C.18.1 

90.A10.sp6:130868.Seq 

90.D10.sp6:130904.Seq 

RTA00000189AF.f.7.1 

90.E10.sp6:130916.Seq 

RTA00000189AF.f.8.1 

90.F10.sp6:130928.Seq 

RTA00000189AF.f.l7.1 

90.G10.sp6:130940.Seq 

RTA00000189AF.f.l8.1 

90.H10.sp6:130952.Seq 

RTA00000189AF.g.l.l 

90.Bll.sp6: 130881. Seq 

RTA00000 1 89AF.g.5. 1 .SeqTHC 1 961 02 

RTA00000189AF.g.5.1 
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WO 99/33982 
cDNA Library ESI 5 - ATCC# 
Deposit Date - December 22, 1998 

Clone Name Cluster ID 
M00003867A:D10 
M00003867A:D10 

M00003871C:E02 4573 

M00003875C:G07 8479 

M00003875C:G07 8479 
M00003875D:D1 1 
M00003875D:D1 1 

M00003876D:E12 7798 

M00003876D:E12 7798 

M00003906C-.E10 9285 

M00003906C:E10 9285 

M00003907D:A09 39809 

M00003907D:A09 39809 

M00003907D:A09 39809 

M00003907D:H04 16317 

M00003907D:H04 16317 

M00003909D-.C03 8672 

M00003909D:C03 8672 

M00003968B-.F06 24488 

M00003968B:F06 24488 

M00003970C-.B09 40122 

Mftnnft-?970C:B09 40122 
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M00003970C:B09 


40122 


M00003974D:E07 


23210 


M00003974D:E07 


23210 


M00003974D-.E07 


23210 


M00003974D:H02 


23358 


M00003974D:H02 


23358 


M00003974D:H02 


23358 


M00003981A-.E10 


3430 


M00003981A:E10 


3430 


M00003982C-.C02 


2433 


M00003982C:C02 


2433 


M00003982C:C02 


2433 


M00004028D:C05 


40073 


M00004028D:C05 


40073 


M00004035C:A07 


37285 


M00004035C:A07 


37285 


M00004035D:B06 


17036 


M00004035D:B06 


17036 


M00004072A-.C03 




M00004072A:C03 


15069 


M00004081C:D10 


M00004081C:D10 


15069 


M00004086D-.G06 


9285 


M00004086D:G06 


9285 


M00004105C:A04 


7221 


M00004105C:A04 


7221 


M00004171D:B03 


4908 


M00004171D:B03 


4908 


M00004185C:C03 


11443 


M00004185C:C03 


11443 



Sequence Name 

90.C1 l.sp6:l 30893. Seq 

RTA00000189AF.h.l7.1 

RTA00000189AF.j.l2.1 

90.Gll.sp6: 130941. Seq 

RTA00000189AF.j.22.1 

90.Hll.sp6:130953.Seq 

RTA00000189AF.J.23.1 

90.A12.sp6:130870.Seq 

RTA00000189AF.k.l2.1 

90.H12.sp6:130954.Seq 

RTA00000190AF.d.7.1 

99.Al.sp6:131230.Seq 

RTA00000 1 90AF.e.3 . 1 .Seq_THC 1 502 1 7 

RTA00000190AF.e.3.1 

99.Bl.sp6:131242.Seq 

RTA00000190AF.e.6.1 

RTA00000190AF.f.ll.l 

99.Cl.sp6:131254.Seq 

RTA00000190AF.n.l6.1 

99.C2.sp6:131255.Seq 

RTA00000 1 90AF.n.23 . 1 
RTA00000190AF.n.23.1.Seq_THC109227 

99.D2.sp6:131267.Seq 

RTA00000190AF.O.20.1 

RTA00000190AF.o.20.1.Seq_THC207240 

99.E2.sp6:131279.Seq 

RTA000001 90AF.O.2 1 . 1 .Seq_THC207240 

RTA00000190AF.O.21.1 

99.F2.sp6:13 1291. Seq 

99.A3.sp6:131232.Seq 

RTA00000191AF.a.9.1 

RTA00000191AF.a.l5.2 

99.B3.sp6:131244.Seq 
RTA00000191AF.a.l5.2.Seq_THC79498 

RTA00000191AF.e.3.1 

99.E3.sp6:131280.Seq 

99.H3.sp6:131316.Seq 

RTA00000191AF.f.ll.l 

RTA00000191AF.f.l3.1 

99.A4.sp6:131233.Seq 

RTA00000191AFJ.9.1 

99.D4.sp6:131269.Seq 

99.F4.sp6:131293.Seq 

RTA00000191AF.1.6.1 

99.H4.sp6:131317.Seq 

RTA00000191AF.m.l8.1 

99.D5.sp6:131270.Seq 

RTA00000191AF.p.9.1 

RTA00000192AF.j.2.1 

99.F6.sp6:131295.Seq 

RTA00000192AF.1.13.2 

123.A8.sp6:132272.Seq 
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Cluster ID 
11443 



WO 99/33982 
cDNA Library ES 1 5 - ATCC# 
Deposit Date - December 2 2, 1998 
Clone Name 
M00004185C.C03 
M00004191D:B11 
M00004191D.B11 
M00004191D:B11 
M00004197D:H01 
M00004197D.-H01 
M00004197D:H01 
M00004203B:C12 
M00004203B.-C12 
M00004214C:H05 
M00004214C:H05 
M00004223D:E04 
M00004223D:E04 
M00004269D:D06 
M00004269D.D06 
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8210 
8210 
8210 
14311 
14311 
11451 
11451 
12971 
12971 
4905 
4905 



M00004295D:F12 16921 
muuuu^yDu:r u 16921 
M00004296C.H07 13046 
M00004296C:H07 13046 
M00004307C:A06 9457 
M00004307C:A06 9457 
M00004307C:A06 9457 

M00004312A:G03 26295 

M000043 1 2A:G03 26295 

M00004312A:G03 26295 

M00004318C:D10 21847 

M00004318C:D10 21847 

M00004359B:G02 

M00004359B.G02 

M00004505D:F08 

M00004505D:F08 

M00004692A:H08 

M00004692A:H08 

M00004692A:H08 

M00005180C.G03 



Sequence Name ' 

99.A7.sp6:131236.Seq 

RTA00000192AF.m.l2.1 

99.B7.sp6:] 3 1248.Seq 

123.C8.sp6:132296.Seq 

99.C7.sp6:131260.Seq 

123.E8.sp6:132320.Seq 

RTA00000192AF.n.l3.1 

99.D7.sp6:131272.Seq 

RTA00000192AF.O.2.1 

177.D8.sp6:134747.Seq 

RTA00000192AF.p.l7 1 

RTA00000193AF.a.20.1 

99.B8.sp6:131249.Seq 

99.H8.sp6.131321.Seq 

RTA00000193AF.e.l4.1 

99.D9.sp6:131274.Sea 

RTA00000193AF.h.l5.1 

99.E9.sp6:131286.Seq 

RTA00000193AF.h.l9.1 

RTA00000193AF.i.l4.2 

99.F9.sp6:131298.Seq 

123.Dll.sp6:132311.Seq 

RTA00000193AF.i.24 2 

99.G9.sp6.131310.Seq 

RTA000001 93 AF.i.24.2.Seq_THC 1 97345 

RTA00000193AF.j.9.1 

99.H9.sp6:131322.Seq 

RTA000001 93 AF.m.5. 1 .SeqTHC 1 733 1 8 

RTA00000193AF.m.5.1 

RTA00000194AF.b.l9.1 

99.H10.sp6: 13 1323. Seq 

99.Bll.sp6:131252.Seq 

RTA00000194AF.C.24.1 

377.F4.sp6:141957.Seq 

RTA00000194AF.f.4.1 
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cDNA Library ES 1 6 - ATCC# 
Deposit Date - December 22, 1998 

Clone Name Cluster ID 

M00001346D:E03 6806 
M00001350A-.B08 
M00001350A:B08 

M00001357D:D11 4059 

M00001357D-.D11 4059 

M00001409C.D12 9577 

M00001409C:D12 9577 

M00001418B:F03 9952 

M00001418B:F03 9952 

M00001418B:F03 9952 

M00001418D-.B06 8526 

M00001421C:F01 9577 

M00001421C:F01 9577 

M00001429B:A11 4635 
M00001432C-.F06 

M00001439C:F08 40054 

M00001442C:D07 16731 

M00001442C:D07 16731 
M00001443B:F01 
M00001443B:F01 

M00001445A:F05 13532 

M00001445A:F05 13532 

M00001446A:F05 7801 

M00001455A:E09 13238 

M00001455A:E09 13238 

M00001460A:F12 39498 

M00001481D:A05 7985 

M00001490B:C04 18699 

M0000 1 490B:C04 1 8699 

M00001500C:E04 9443 

M00001500C:E04 9443 

M00001532B-.A06 3990 

M00001532B:A06 3990 

M00001534A:F09 5321 

M00001534A:F09 5321 

M00001535A:B01 7665 

M00001536A:C08 39392 

M00001536A:C08 39392 

M00001541A:F07 22085 
M00001542B:B01 
M00001542B:B01 

M00001544A:E03 12170 

M00001545A:C03 19255 

M00001545A:C03 19255 

M00001545A:C03 19255 

M00001548A:H09 1058 

M00001548A:H09 1058 

M00001548A:H09 1058 

M00001549A-.B02 4015 

M00001549A.B02 4015 

MO000 1 549A:D08 1 0944 

M00001552B:D04 5708 
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Sequence Name 

RTA00000177AF.g.l3.3 

80.H2.sp6:l 30293 .Seq 

RTA00000177AF.i.6.2 

RTA00000 1 77AF.n. 1 8.3 .Seq_THC 1 2305 1 

RTA00000177AF.n.l8.3 

RTA00000179AF.O.17.1 

80.E7.sp6:130262.Seq 

RTA00000180AF.C.20.1 

RTA00000 1 80AF.C.20. 1 .Seq_THC 1 62284 

80.E8.sp6: 130263. Seq 

RTA00000180AF.d.l.l 

RTA00000180AF.d.23.1 

80.G8.sp6:130287.Seq 

RTA000001 80AF.i.20.1 

RTA000001 80AF.k.24. 1 

RTA00000180AF.p.l0.1 

RTA00000181AF.a.20.1 

80.C10.sp6:130241.Seq 

80.D10.sp6:130253.Seq 

RTA00000181AF.b.7.1 

80.E10.sp6:130265.Seq 
RTA00000181AF.C.4.1 

RTA00000181AF.C.21.1 

RTA00000181AF.m.4.1 

RTA000001 8 1 AF.m.4. 1 .Seq_THC 1 4069 1 

RTA00000119A.j.20.1 

RTAOOOOOl 82AR.J.2.1 

RTAOOOOOl 82AF.m. 16.1 

89.D3.sp6:130705.Seq 

89.B4.sp6:130682.Seq 

RTA00000183AF.C.1.1 

89.G6.sp6:130744.Seq 

RTAOOOOOl 83 AF.j. 11.1 

89.B7.sp6:130685.Seq 

RTAOOOOOl 83 AF.k.8.1 

RTA00000134A.1.19.1 

89.G7.sp6:130745.Seq 

RTA00000134A.m.l6.1 

RTA00000135A.e.5.2 

RTAOOOOOl 83AF.p.4.1 

89.F8.sp6:130734.Seq 

RTA00000125A.h.l8.4 

RTA00000135A.m.l8.1 

184.B10.sp6:135547.Seq 

89.C9.sp6:130699.Seq 

RTAOOOOO 126A.e.20.3.Seq_THC2 1 7534 

RTAOOOOOl 26A.e.20.3 
79.F6.sp6: 130081. Seq 
RTA00000136A.e.l2.1 
79.G6.sp6: 130093. Seq 
RTAOOOOO 126A.h. 17.2 
RTA00000184AF.g.l2.1 
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Clone Name 
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M00001552B:D04 
M00001552D:A01 
M00001552D.-A01 
M00001553D:D10 
M00001553D.D10 
M00001558A:H05 
M0OOO1558A:H05 



Cluster ID 
5708 



22814 
22814 



MU0U0156I A:C05 


39486 


M0000 1 56 1 A;C05 


39486 


M00001564A:B12 


5053 


M00001578B:E04 


23001 


iviuuuui j /yU:C03 


6539 


M00001579D:C03 


6539 


M00001579D:C03 


6539 


M00001582D:F05 




MOOO01587A.B11 


39380 


muuuui36/A:tJJ J 


39380 


M00001604A.F05 


39391 


M00001604A:F05 


39391 


M00001624A:B06 


3277 


M00001624A:B06 


3277 


M00001624A.-B06 


3277 


M00001630B:H09 


5214 


M00001630B:H09 


5214 


M00001630B:H09 


5214 



M00001651A:H01 




M00001651A.H01 




M00001677C:E10 


14627 


M00001679C:F01 


78091 


M00001679C.F01 


78091 


M00001679C:F01 


78091 


M00001686A.E06 


4622 


M00003796C:D05 


5619 


M00003796C.D05 


5619 


M00003826B.A06 


11350 


M00003826B:A06 


11350 


M00003833A:E05 


21877 


M00003837D:A01 


7899 


M00003837D.A01 


7899 


M00003846B:D06 


6874 


M00003846B:D06 


6874 


M00003879B:D10 


31587 


M00003879B.D10 


31587 


M00003879D:A02 


14507 


M00003879D.A02 


14507 


M00003891C:H09 




M00003891C.H09 




M00003912B:D01 


12532 


M00003912B:D01 


12532 


M00004072B:B05 


17036 


M00004081C:D12 


14391 


M00004111D:A08 


6874 



Sequence Name 

89.E10.sp6.130724.Seq 

89.F10.sp6:130736.Seq 

RTA00000184AF.g.22.1 

RTA00000184AF.h.l4.1 

89.All.sp6.130677.Seq 

RTA00000128A.C.20.1 

89. F12.sp6.130738.Seq 
RTA00000128A.m.22.2 
79.B8.sp6:130035.Seq 
RTA00000184AF.O.12.1 
RTA00000185AF.C.24.1 

90. Gl.sp6:130931.Seq 

173.A12.SP6:134080.Seq 

RTA00000185AF.d.ll.l 

RTA00000185AF.d.24.1 

RTA00000129A.e.24.1 

79.E8.sp6:130071.Seq 

RTA00000138A.C.3.1 

79.A9.sp6:130024.Seq 

RTA00000138A.1.5.1 
217.El.sp6:139406.Seq 
90.B4.sp6:130874.Seq 
90.D4.sp6:130898.Seq 

122.C2.sp6:132098.Seq 

RTA00000186AF.g.ll.l 

RTA00000186AF.n.7.1 

90.A5.sp6:130863.Seq 

RTA00000187AF.g.23.1 

90.C7.sp6:130889.Seq 

RTA00000187AF.j.6.1 

176.G5.sp6.i34588.Seq 

RTA00000187AF.m.l5.2 

RTA000001 88AF.1.9. 1 .Seq_THC 1 6784 5 

RTA00000188AF.1.9.1 

RTA000001 89AF.a.24.2 
90.F9.sp6:130927.Seq 
RTA00000189AF.b.21.1 
90.H9.sp6.130951.Seq 
RTA00000189AF.C.10.1 
RTA00000189AF.e.9.1 
90.C10.sp6:130892.Seq 
RTA00000189AF.1.20.1 
90.C12.sp6:130894.Seq 
90.D12.sp6:130906.Seq 
RTA00000189AR.1.23.2 
90.G12.sp6:130942.Seq 
RTA00000189AF.p.8.1 
99.Dl.sp6:131266.Seq 
RTA00000190AF.g.2.1 
RTA00000191AFJ.10.1 
RTA00000191AF.1.7.1 
RTA00000192AF.a.l4.1 
138 
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WO 99/33982 
cDNA Library ESI 6 - ATCC# 
Deposit Date - December 22, 1998 
Clone Name 
M00004111D:A08 
M00004121B:G01 
M00004121B:G01 
M00004121B:G01 
M00004138B:H02 
M00004138B:H02 
M00004151D:B08 
M00004169C:C12 
M00004169C:C12 
M00004169C:C12 
M00004183C:D07 

M00004183C:D07 

M00004230B:C07 

M00004230B-.C07 

M00004249D:F10 

M00004249D:F10 

M00004275C:C11 

M00004275C-.C1 1 

M00004283B:A04 

M00004285B-.E08 

M00004327B:H04 

M00004377C:F05 

M00004384C:D02 

M00004384C.D02 

M00004461A:B08 

M00004461A-.B09 

M00004691D:A05 
M00004896A:C07 



PCT/US98/27610 



10 



Cluster ID 
6874 



13272 

13272 

16977 

5319 

5319 

5319 

16392 

16392 

7212 

7212 



16914 
16914 
14286 
56020 



2102 



Sequence Name 

99.F5.sp6:131294.Seq 

177.H4.sp6:l 34791. Seq 

99.H5.sp6:131318.Seq 

RTA00000192AF.C.2.1 

99.A6.sp6:131235.Seq 

RTA00000192AF.e.3.1 

RTA00000192AF.g.3.1 

99.E6.sp6:131283.Seq 

RTA00000192AF.i.l2.1 

123.F7.sp6:132331.Seq 

RTA00000192AF.1.1.1 
RTA00000192AF.l.l.l.Seq_THC202071 

RTA00000193AF.b.l4.1 

99.D8.sp6:131273.Seq 

RTA00000193AF.C.21 .1 .Seq_THC222602 

RTA00000193AF.C.21.1 

99.A9.sp6:131238.Seq 

RTA00000193AF.f.5.1 

RTA00000193AF.f.22.1 

RTA00000193AF.g.2.1 

RTA00000193AF.j.20.1 

RTA00000193AF.n.7.1 

RTA00000193AF.n.l5.1 

RTA00000193AF.n.l5.1.Seq_THC215687 

RTA00000 1 94AR.a. 1 0.2 
RTA00000194AF.a.ll.l 
RTA00000194AF.C.23.1 
RTA00000194AF.d.l3.1 



The above material has been deposited with the American Type Culture Collection, 
Rockville, Maryland, under the accession number indicated. This deposit will be maintained 
under the terms of the Budapest Treaty on the International Recognition of the Deposit of 
Microorganisms for purposes of Patent Procedure. The deposit will be maintained for a 
period of 30 years following issuance of this patent, or for the enforceable life of the patent, 
whichever is greater. Upon issuance of the patent, the deposit will be available to the public 
from the ATCC without restriction. 

This deposit is provided merely as convenience to those of skill in the art, and is not 
an admission that a deposit is required under 35 U.S.C. §112. The sequence of the 
polynucleotides contained within the deposited material, as well as the amino acid sequence 
of the polypeptides encoded thereby, are incorporated herein by reference and are controlling 
in the event of any conflict with the written description of sequences herein. A license may 
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WO 99/33982 

be reauired ,o make, use, or seU the deposited materia,, and no such l^l^Zei 
hereby. 



10 



15 



Retrieval of Individual Glnn es from Dep n.it of PooIpH r.»n»c 

Where the ATCC deposit is composed of a pool of cDNA clones, the deposit was 
prepared by first transfecting each of the clones into separate bacterial cells. The clones 
were then deposited as a pool of equal mixtures in the composite deposit. Particular clones 
can be obtained from the composite deposit using methods well known in the art For 
example, a bacterial cell containing a particular clone can be identified by isolating single 
colomes, and identifying colonies containing the specific clone through standard colony 
hybnd 12 atu>n techniques, using an oligonucleotide probe or probes designed to specifically 

hybridize to a sequence of the clone insert (e.s.. a orobe h^A , m , 

the encoded polynucleotide having the indicated SEQ ID NO). The probe should be 
designed to have a T m of approximately 80°C (assuming 2°C for each A or T and 4°C for 
each G or C). Positive colonies can then be picked, grown in culture, and the recombinant 
clone isolated. Alternatively, probes designed in this manner can be used to PGR to isolate a 
nuclei acid molecule from the pooled clones according to methods well known in the art 
e.g, by purifying the cDNA from the deposited culture pool, and using the probes in PGR 
reasons to produce an amplified product having the corresponding desired polynucleotide 
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WO 99/33982 
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Table 1. Sequence identification numbers, cluster ID, sequence name, 
SEQIDNO: Cluster ID Sequence Name 



1 


4635 


T* T" A AAAAA 1 °A AP ! 90. 1 

JK 1 AUUUUU 1 ou/\r a.a\j. i 


2 




RTA000UU 1 oW .n. i z. l 


3 


4622 


x% t* A AAAAA1 QH A rrt 1 *\ 9 

RTAUUUUU1 o /Ar .m. 1 j.z 


4 


3706 


nTA AAAAA1Q1 A 17 i 19 9 

RT AUUUUU 1 9 1 Ar .1 . 1 / .z 


5 


36535 


n T A AAAAA 1 81 AF f ^ 1 

R 1 AUUUUU 1 0 1 /\r .1. j . 1 


6 


3990 


Rl AUUUUU 1 oiAr .J. 1 1 . 1 


7 


5319 


RTA00000 1 92 Ar . 1. 1 2 . l 


8 


36393 


RTA00000 1 80Ar .c.2. I 


9 


2623 


txt^ a aaaaai OO A T >-k /C 1 

RTAOOOOO 1 83 Ar .a.o. l 


10 


7587 


RTAOOOOO 1 78Ar .n.24. 1 


11 


7065 


RTAOOOOO 1 37 A.g.o. 1 


12 


10539 


RTAOOOOO 1 87 Ar. 1.7.1 


13 


27250 


RTAOOOOO 1 8 1 AF.g. 1 0. 1 


14 


5556 


RTAOOOOO 1 79 AF.n. 10.1 


15 




RTAOOOOO 1 92 AF.m. 1 2. 1 


16 


8761 


RTAOOOOO 1 84AF.k. 1 2. 1 


17 


4622 


RTAOOOOO 1 89 AF.g. 1 .1 


18 


11460 


RTAOOOOO 1 87 AF.g. 12.1 


19 


16283 


RTA00000 1 20A.O.20. 1 


20 


3430 


RTA00000 191 Ar .a.9. 1 


21 


7065 


RTAOOOOO 1 84 AF.j .21 . 1 


22 




•nm A A A A A A 1 A T7 1 OA 1 

RTA00000 1 82 AF . 1 .20 . 1 


23 




nrp A A A A/\A 1 Ol A » 1 Q 1 

RTAOOOOO 1 23 A.g. 19.1 


24 


16918 


rr^ a A. A A A A t AO A L T ^ 1 1 

RTAOOOOO 1 93 Ar .a. 1 o. l 


25 


16914 


t» t 1 a a a a aa 1 O'J A IT -f *C 1 

RTA000UU 1 93 Ar .I.D . 1 


26 


40108 


t-v t> a AAAAA1 0*7 A IT ax *") /I 1 

RTAOOOOO 1 87 Ar .0.24. l 


27 


14286 


T%*T» A AA AAA 1 C\*5 A t? -f OO 1 

RTAOOOOO 1 93 Ar .1.22. 1 


28 


17004 


TITA AAAAA1 Q^AP K 91 1 

RTAOUUUU 1 ooAr .D.z i . i 


29 




R 1 AUUUUU i oU/\r i 


30 


13272 


nTAAAAAniQ9 AF p 'J 1 

R 1 AUUUUU i yz/\r .e. j . 1 


31 




T5T 1 A AAAAA1 GAAP f d 1 

K 1 AUUUUU 1 7*t/vr a.H.i 


32 


32663 


DTAft A ftftH1 16A I fi | 

K 1 AUUUUU 1 1 o/\.i.o. i 


33 




DTAnnoftftl R0AF a 9 1 


34 


5832 


RTA00000178AF.O.23.1 


35 


7801 


RTA000001 8 1 AF.c.2 1 . 1 


36 


76760 


RTA00000187AF.a.l5.1 


37 


40132 


RTA00000178AF.C.7.1 


38 




RTA00000183AF.e.l.l 


39 


4016 


RTA00000118A.C.4.1 


40 


5382 


RTA00000187AF.m.23. 



and clone name 
Clone Name 
M00001429B:A1 1 
M00001608D:A11 
M00001686A:E06 
M00004068B-.A01 
M00001449A:G10 
M00001532B-.A06 
M00004169C:C12 
M00001417A:E02 
M00001497A:G02 
M00001387B:G03 
M00001557A:D02 
M00001680D:F08 
M00001450A:D08 
M00001407B:D11 
M00004191D:B11 
M00001557D:D09 
M00003856B-.C02 
M00001676B:F05 
M00001467A:D08 
M00003981A:E10 
M00001557A:D02 
M00001488B:F12 
M00001531A:H11 
M00004223A:G10 
M00004275C:C1 1 
M00003741D:C09 
M00004283B-.A04 
M00001617C:E02 
M00001426B:D12 
M00004138B.H02 
M00005180C:G03 
M00001450A:A11 
M00001414A:B01 
M00001388D:G05 
M00001446A:F05 
M00001657D:F08 
M00001365C-.C10 
M00001505C:C05 
M00001395A:C03 
M00001688C.F09 
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WO 99/33982 
SEQIDNO: Cluster 

41 5693 

42 307 

43 39833 
44 

45 5325 

46 5325 

47 18957 

48 39508 

49 22390 

50 12170 

51 4393 

52 19 
53 

54 7899 

55 40073 

56 7005 
57 

58 18957 

59 J8957 

60 16283 

61 6974 

62 2623 

63 9105 

64 13238 

65 5749 

66 6455 

67 23001 

68 6455 

69 13595 

70 39442 

71 17036 
72 

73 7005 

74 6268 

75 16130 

76 23201 

77 5321 

78 13157 

79 2102 

80 1058 

81 40392 
82 

83 11476 



ID Sequence Name 

RTA00000190AF.p.l7.2 

RTA00000136A.O.4.2 

RTA00000178AF.i.23.1 

RTA00000193AF.m.5.1 

RTA00000191AF.O.6.1 

RTA00000191AF.O.6.2 

RTA00000190AR.m.9.1 

RTA00000120A.O.2.1 

RTA00000136A.j.l3.1 

RTA00000125A.h.l8.4 

RTA00000187AF.n.l7.1 

RTA00000182AF.b.7.1 

RTA00000193AF.C.21.1 

RTA00000189AF.C.10.1 

BTAflnftnnim a t? . t 1 
>uuuwu i j .CO. 1 

RTA00000179AF.O.22.1 

RTA00000187AF.h.22.1 

RTA00000190AF.m.9.2 

RTA00000183AF.h.23.1 

RTA00000182AF.C.22.1 

RTA00000183AF.d.9.1 

RTA00000183AF.b.l4.1 

RTA00000191AF.a.21.2 

RTA00000181AF.m.4.1 

RTA00000185AF.a.l9.1 

RTA00000193AF.b.9.1 

RTA00000185AF.C.24.1 

RTA00000192AF.g.23.1 

RTA00000189AF.f.8.1 

RTA00000120A.O.21.1 

RTA00000191AF.f.l3.1 

RTA00000183AF.g.9.1 

RTA00000181AF.k.24.1 

RTA00000126A.O.23.1 

RTA00000119A.C.13.1 

RTA00000187AF.a.l4.1 

RTA00000183AF.k.8.1 

RTA00000186AF.a.6.I 

RTA00000193AF.n.7.1 

RTA00000126A.e.20.3 

RTA00000180AF.j.8.1 

RTA00000183AF.e.23.1 

RTA00000187AF.p.l9.1 
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PCT/US98/27610 
Clone Name 

M00003978B.G05 

M00001552A.B12 

M00001378B:B02 

M00004359B:G02 

M00004093D:B12 

M00004093D:B12 

M00003958A:H02 

M00001467A.D04 

M00001551A:G06 

M00001544A:E03 

M00001693C:G01 

M00001463C:B11 

M00004249D:F10 

M00003837D:A01 

M00004028D:C05 

M00001410A:D07 

M00001679A:F06 

M00003958A.H02 

M00001528A:F09 

M00001467A:D08 

M00001504C:H06 

M00001500A:E11 

M00003983A:A05 

M00001455A:E09 

M00001571C.H06 

M00004229B:F08 

M0000I578B:E04 

M00004157C:A09 

M00003851B.D10 

M00001467A:E10 

M00004035D:B06 

M00001513B:G03 

M00001454B:C12 

M00001551A.B10 

M00001453A.E11 

M00001657D:C03 

M00001534A.F09 

M0000I614C.F10 

M00004377C.F05 

M00001548A:H09 

M00001429D:D07 

M00001506D.A09 

M00003747D:C05 



BNSDOCID: <WO 9933982A2J_> 



WO 99/33982 
SEQ ID NO: Cluster ID 

84 3584 

85 10470 

86 39425 

87 5175 

88 13576 

89 7665 

90 16927 

91 6660 

92 2433 

93 5097 

94 21847 

95 3277 

96 5708 

97 945 

98 16269 
99 

100 16731 

101 12439 

102 3162 
103 

104 
105 

106 10717 

107 4573 
108 

109 H443 

110 5892 

111 3162 

112 10470 

113 17055 

114 2030 

115 6558 

116 23255 

117 9577 
118 

119 8 

120 67907 

121 12081 

122 2448 

123 3389 

124 39174 

125 24488 

126 8210 



Sequence Name 

RTA00000177AF.h.20.1 

RTA00000180AF.f.l8.1 

RTA00000133A.f.l.l 

RTA00000184AF.f.3.1 

RTA00000 1 89AF.0. 13.1 

RTA00000134A.1.19.1 

RTA00000177AF.h.9.3 

RTA00000187AF.h.l5.1 

RTA00000191AF.a.l5.2 

RTA00000134A.k.l.l 

RTA00000193AFJ.9.1 

RTA00000138A.1.5.1 

RTA00000184AF.g.l2.1 

RTA00000 1 78 AR.a.20 . 1 

RTA00000178AF.p.l.l 

RTA00OO0 1 83 AF.c.24. 1 

RTA00000181AF.a.20.1 

RTAOOOOO 1 90AF.O.24. 1 

RTA00000177AF.j.l2.3 

RTA000001 94AF.b. 19. 1 

RTA00000193AF.n.l5.1 

RTAOOOOO 186AF.n.7.1 

RTA00000181AF.d.l0.1 

RTA00000189AFJ.12.1 

RTA000001 86AF.h.l4.1 

RTA00000192AF.1.13.2 

RTAOOOOO 184AF.d. 11.1 

RTA00000177AF.j.l2.1 

RTA00000185AF.k.6.1 

RTAOOOOO 1 87 AF.m.3.1 

RTA000001 93 AF.ra.20. 1 

RTA000001 84AF.m.21 .1 

RTA00000190AFJ.4.1 

RTAOOOOO 1 79 AF .o. 1 7. 1 

RTA00000180AF.a.l 1.1 

RTA00000181AF.e.l7.1 

RTA00000188AF.g.ll.l 

RTA00000133A.d.l4.2 

RTA00000119A.j.21.1 

RTAOOOOO 1 89 AF.g.3.1 

RTA00000124A.n.l3.1 

RTAOOOOO 1 90 AF .n. 1 6. 1 

RTA00000192AF.n.l3.1 
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Clone Name 

M00001349B:B08 

M00001424B.G09 

M00001470A:C04 

M00001550A:G01 

M00003885C-.A02 

M00001535A:B01 

M00001348B:B04 

M00001679A-.A06 

M00003982C:C02 

M00001534A:D09 

M00004318C:D10 

M00001624A:B06 

M00001552B:D04 

M00001362C:H11 

M000O1389A:C08 

M00001504A:E01 

M00001442C-.D07 

M00003975A:G11 

MOO0O1351B:A08 

M00004505D:F08 

M00004384C:D02 

M00001651A:H01 

M00001447A:G03 

M00003871C:E02 

M00001632D:HO7 

M00004185C:C03 

M00001548A:E10 

M00001351B.A08 

M00001597D:C05 

M00001682C:B12 

M00004372A:A03 

M00001560D:F10 

M00003922A:E06 

M00001409C:D12 

M00001414C:A07 

M00001448D:C09 

M00003774C:A03 

M00001469A:C10 

M00001460A-.F06 

M00003857A-.G10 

M00001541A:H03 

M00003968B:F06 

M00004197D:H01 



WO 99/33982 
SEQ ID NO: Cluster 

127 



128 

129 

130 

131 

132 

133 

134 

135 

136 

137 

138 

139 

140 

1 A 1 
iii 

142 

143 

144 

145 

146 

147 

148 

149 

150 

151 

152 

153 

154 

155 

156 

157 

158 

159 

160 

161 

162 

163 

164 

165 

166 

167 

168 

169 



40455 

9577 

13183 

5214 

67252 

3108 

2464 

36313 

23255 

7985 

8286 

22195 

4573 

Z.OO/J 

7187 

86859 

4623 

10185 

15069 

39304 

8672 

13576 

6293 

16977 

5345 

4905 
17036 

5417 

7172 

40044 

4386 

40044 

9685 

22155 

10515 

6539 

15066 

4261 

13864 

6539 



ID Sequence Name 

RTA00000135A.1.2.2 

RTA00000190AF.m.l0.2 

RTA00000180AF.d.23.1 

RTA00000192AF.a.24.1 

RTA00000186AF.g.ll.l 

RTA00000187AF.O.6.1 

RTA00000188AF.d.24.1 

RTA00000178AF.n.l8.1 

RTA00000181AF.e.23.1 

RTA00000177AF.e.l4.3 

RTA00000182AR.j.2.1 

RTA00000183AF.O.1.1 

RTA00000180AF.g.7.1 

RTA00000184AF.h.9.1 

RTA00000187AF.i.l.l 

RTA00000177AF.L8.2 

RTA00000118A.p.8.1 

RTA00000185AF.f.4.1 

RTA00000121A.C.10.1 

RTA00000183AF.d.5.1 

RTA00000183AF.p.4.1 

RTA00000191AF.1.6.1 

RTA00000118A.j.21.1 

RTA00000190AF.f.ll.l 

RTA00000177AF.g.l6.1 

RTA00000185AF.e.ll.l 

RTA00000192AF.g.3.1 

RTA00000189AF.1.19.1 

RTA00000193AF.e.l4.1 

RTA00000191AF.j.l0.1 

RTA00000191AF.h.l9.1 

RTA00000178AF.f.9.1 

RTA00000186AF.d.l.l 

RTA00000184AF.j.4.1 

RTA00000183AF.g.22.1 

RTA00000183AF.C.11.1 

RTA00000185AF.n.9.1 

RTA00000189AF.f.l8.1 

RTA00000185AF.d.ll.l 

RTA00000180AF.e.24.1 

RTA00000180AF.h.5.1 

RTA00000125A.m.9.1 

RTA00000189AF.d.22.1 
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PCT/US98/27610 
Clone Name 

M00001545A:B02 

M00003958C:G10 

M00001421C:F01 

M00004114C:F11 

M0000I630B:H09 

M00001716D:H05 

M00003763A:F06 

M00001387A:C05 

M00001448D:H01 

M00001343D:H07 

M00001481D:A05 

M00001540A:D06 

M00001425B:H08 

M00001553B:F12 

M00001679A:F10 

M00001350A:H01 

M00001452A:B12 

M00001586C:C05 

M00001469A:A01 

M00001504C.A07 

M00001542B.B01 

M00004081C.D10 

M00001450A.A02 

M00003909D:C03 

M00001347A:B10 

M00001583D.A10 

M00004151D:B08 

M00003879B:C11 

M00004269D:D06 

M00004072B:B05 

M00004059A:D06 

M00001371C:E09 

M00001621C:C08 

M00001556B:C08 

M00001514C:D11 

M00001501D:C02 

M00001608B:E03 

M00003853A:F12 

M00001579D:C03 

M00001423B.E07 

M00001426D:C08 

M00001545A.-D08 

M00003844C.B11 



BNSDOCID: <WO_9933982A2_l_> 



WO 99/33982 
SEQIDNO: Cluster ID 

170 H465 

171 3266 

172 102 

173 16970 

174 12971 

175 5007 

176 3765 

177 11294 

178 3681 

179 9283 

180 18699 

181 86110 

182 39648 

183 7337 

184 1334 

185 17076 

186 22794 

187 39171 

188 8551 

189 5857 

190 9443 

191 9457 

192 7206 

193 22979 

194 40455 

195 7221 
196 

197 7239 

198 31587 

199 16317 

200 13576 

201 5779 

202 6124 

203 9952 
204 

205 5779 

206 39490 

207 4416 

208 4009 

209 5336 

210 39186 

211 40122 

212 12532 



Sequence Name 

RTA00000185AF.m. 19.1 

RTA00000184AR.g.l-l 

RTA00000184AF.O.5.1 

RTA00000181AR.i.l8.2 

RTA000001 93 AF.a.20. 1 

RTA00000177AF.g.2.1 

RTA00000135A.d.l.l 

RTA00000184AFJ.6.1 

RTA00000131A.g.l5.2 

RTA00000 1 8 1 AR.m.2 1 .2 

RTAO0000 1 82 AF.m. 16.1 

RTA00000181AF.f.l2.1 

RTA00000178AR.1.8.2 

RTA00000123A.b.l7.1 

RTA00000178AF.j.7.1 

RTA00000188AF.d.21.1 

RTA00000138A.b.5.1 

RTA00000186AF.1.7.1 

RTA00000179AF.p.21.1 

RTA00000118A.g.l4.1 

RTA00000183AF.C.1.1 

RTA00000193AF.U4.2 

RTA00000182AF.O.15.1 

RTA00O00 1 7 8AF.k.22. 1 

RTA00000 1 90AR.m. 1 0. 1 

RTA00000191AF.p.9.1 

RTA00000191AF.j.9.1 

RTA00000126A.m.4.2 

RTA00000189AF.1.20.1 

RTA00000190AF.e.6.1 

RTAO0O00 1 89 AR.o. 13.1 

RTA00000177AF.g.l4.3 

RTA00000191AR.e.2.3 

RTA00000180AF.C.20.1 

RTA00000188AF.L8.1 

RTA00000177AF.g.l4.1 

RTA00000128A.b.4.1 

RTA00000187AF.h.l3.1 

RTA00000 1 79AF.e.20. 1 

RTA00000183AF.b.l3.1 

RTA00000121A.p.l5.1 

RTA00000 1 90 AF.n.23 . 1 

RTA00000 190AF.g.2. 1 

145 



PCT/US98/27610 
Clone Name 
M00001607A:E11 
M00001551C:G09 
M00001563B:F06 
M00001452C:B06 
M00004223D:E04 
M00001346A:F09 
M00001541A:D02 
M00001556B:G02 
M00001449A:D12 
M00001455D:F09 
M00001490B:C04 
M00001449C:D06 
M00001383A:C03 
M00001528A:C04 
M00001379A:A05 
M00003762C-.B08 
M00001601A-.D08 
M00001644C:B07 
M00001412B:B10 
M00001449A:A12 
M00001500C:E04 
M00004307C:A06 
M00001494D-.F06 
M00001382C:A02 
M00003958C:G10 
M00004105C:A04 
M00004072A:C03 
M00001550A-.A03 
M00003879B:D10 
M00003907D:H04 
M00003885C:A02 
M00001346D:G06 
M00004028D:A06 
M00001418B:F03 
M00003784D:D12 
M00001346D-.G06 
M00001557A:F03 
M00001678D:F12 
M00001396A:C03 
M00001500A:C05 
M00001512A-.A09 
M00003970C:B09 
M00003912B:D01 



WO 99/33982 
SEQ ID NO: Cluster 



213 

214 

215 

216 

217 

218 

219 

220 

221 

222 

223 

224 

225 

226 

227 

228 

229 

230 

231 

232 

233 

234 

235 

236 

237 

238 

239 

240 

241 

242 

243 

244 

245 

246 

247 

248 

249 

250 

251 

252 

253 

254 

255 



8078 

3900 

7589 

8298 

4443 

26295 

3389 

7015 

8526 

4665 

1399 

9244 

82498 

1C-7/V> 

■J -J I \J£. 

38759 

39648 

19105 

85064 

9285 

9285 

39391 

39498 
7798 
7798 
19829 

4275 

22113 

40314 

10944 

39809 

22085 

19255 

14311 

8479 

4193 

22814 

39563 

39420 

11589 



H> Sequence Name 

RTA00000177AR.1.13.1 

RTA00000190AF.g.l3.1 

RTA00000120A.p.23.1 

RTA00000127A.d.l9.1 

RTA00000177AF.b.20.4 

RTA00000193AF.i.24.2 

RTA00000183AF.m.l9.1 

RTA00000187AF.f.l8.1 

RTA00000180AF.d.l.l 

RTA00000186AF.m.3.1 

RTA00000129A.O.10.1 

RTA00000127A.1.3.1 

RTA00000179AF.j.l3.1 

RTAb00001I8A.m.l0.1 

RTA00000 i 87AR.C. 1 5.2 

RTA00000120A.m.l2.3 

RTA00000178AF.1.8.1 

RTA00000133A.e.l5.1 

RTAO0OOOI31A.m.23.1 

RTA00000191AF.m.l8.1 

RTA00000190AF.d.7.1 

RTA00000138A.C.3.1 

RTA00000178AF.d.20.1 

RTA00000119A.j.20.1 

RTA00000189AF.k.l2.1 

RTA00000189AF.C.18.1 

RTA00000125A.h.24.4 

RTA00000188AF.d.ll.l 

RTA00000120A.j.l4.1 

RTA00000125A.C.7.1 

RTA00000186AF.C.15.1 

RTA00000126A.h.l7.2 

RTA00000190AF.e.3.1 

RTA00000135A.e.5.2 

RTA00000135A.m.l8.1 

RTA00000192AF.O.2.1 

RTA00000189AF.j.22.1 

RTA00000189AF.j.23.1 

RTA00000184AF.e.l3.1 

RTA00000184AF.h.l4.1 

RTA00000179AF.k.20.1 

RTA00000134A.O.23.1 

RTA00000I77AF.b.l7.4 
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Clone Name 

M00001353A:G12 

M00003914C:F05 

M00001468A:F05 

M00001553A.H06 

M00001341A:E12 

M00004312A:G03 

M0O001537B:GO7 

M00001673C:H02 

M00001418D:B06 

M00001648C:A01 

M00001604A:B10 

M00001556A:C09 

M00001400B.H06 

M00001450A.B12 

M00001663A.E04 

M00001467A.B07 

M00001383A:C03 

M00001469A:H12 

M00001452A.F05 

M00004086D.G06 

M00003906C:E10 

M00001604A:F05 

M00001368D:E03 

M00001460A.F1 2 

M00003876D.-E12 

M00003839A:D08 

M00001544A:G02 

M00003761D:A09 

M00001466A:E07 

M00001542A:A09 

M00001619C:F12 

M00001549A.D08 

M00003907D.A09 

M00001541A:F07 

M00001545A:C03 

M00004203B:C12 

M00003875C:G07 

M00003875D:D1 1 

M00001549B:F06 

M00001553D:D10 

M00001402A:E08 

M00001537A.F12 

M00001340D.F10 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 
SEQIDNO: Cluster ID 

256 4937 

257 39412 

258 4837 

259 13046 

260 4141 

261 38085 
262 

263 11451 

264 14507 

265 40054 

266 39423 

267 39453 

268 10751 

269 10751 

270 78091 

271 39539 
272 

273 
274 

275 9283 

276 39168 

277 39458 

278 14391 

279 39195 

280 7212 

281 4015 

282 12977 
283 

284 14391 

285 

286 

287 8358 

288 1267 
289 

290 16347 

291 7899 

292 2379 

293 39478 

294 39392 

295 5053 

296 16999 

297 39180 

298 1037 



Sequence Name 

RTA00000191AF.p.21.1 

RTA00000133A.k.l7.1 

RTA00000185AR.k.3.2 

RTA00000193AF.h.l9.1 

RTA00000177AF.p.20.3 

RTA00000123A.e.l5.1 

RTAOOOOO 1 89 AF.p.8.1 

RTA00000192AF.p.l7.1 

RTA00000189AR.1.23.2 

RTA00000180AF.p.l0.1 

RTA00000134A.k.22.1 

RTA00000135A.g.ll.l 

RTA00000187AF.k.7.1 

RTA00000187AF.k.6.1 

RTA00000187AF.j.6.1 

RTAOOOOO 127A.i.2 1.1 

RTA00000182AF.1.15.1 

RTA00000194AF.d.l3.1 

RTA00000128A.C.20.1 

RTA0O000 1 8 1 AR.m.22.2 

RTA00000121A.1.10.1 

RTA00000126A.p.l5.2 

RTA00000 1 77AF.m. 17.3 

RTA00000137A.C.16.1 

RTA00000193AF.b.l4.1 

RTA00000136A.e.l2.1 

RTA00000189AF.J.19.1 

RTA00000178AF.m.l3.1 

RTA00000191AF.1.7.1 

RTAOOOOO 1 94 AF.c.23 . 1 

RTA00000181AF.b.7.1 

RTA00000183AF.i.5.1 

RTA00000125A.O.5.1 

RTA00000189AF.f.7.1 

RTA000001 84AF.e. 15. 1 

RTA00000193AF.a.l7.1 

RTA00000178AF.a.6.1 

RTA00000133A.i.5.1 

RTA00000134A.m.l6.1 

RTAOOOOO 1 84AF.0. 12. 1 

RTA00000185AF.k.9.1 

RTAOOOOO 126A.n.8.2 

RTA00000121A.f.8.1 
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PCTAJS98/27610 
Clone Name 

M00004108A-.E06 

M00001511A:H06 

M00001597C:H02 

M00004296C:H07 

M00001361A:A05 

M00001531A:D01 

M00003891C:H09 

M00004214C:H05 

M00003879D:A02 

M00001439C:F08 

M00001535A-.F10 

M00001542A:E06 

M00001679D:D03 

M00001679D:D03 

M00001679C:F01 

M00001555A:B02 

M00001487B:H06 

M00004896A:C07 

M00001558A-.H05 

M00001455D:F09 

M00001507A:H05 

M00001552A:D11 

M00001355B:G10 

M00001555A:C01 

M00004230B:C07 

M00001549A:B02 

M00003875B:F04 

M00001384B:A11 

M00004081C:D12 

M00004691D:A05 

M00001443B:F01 

M00001528B:H04 

M00001546A:G11 

M00003851B-.D08 

M00001549C:E06 

M00004223B:D09 

M00001361D:F08 

M00001471A:B01 

M00001536A:C08 

M00001564A:B12 

M00001598A-.G03 

M00001551A-.F05 

M00001470A:B10 



i 



WO 99/33982 
SEQ ID NO: Cluster 

299 6867 

300 10539 

301 41633 

302 23218 

303 39380 
304 

305 

306 6974 
307 

308 15855 

309 84328 

310 10145 

311 39805 
312 

3 1 1 aooo 



314 

315 

316 

317 

318 

319 

320 

321 

322 

323 

324 

325 

326 

327 

328 

329 

330 

331 

332 

333 

334 

335 

336 

337 

338 

339 

340 

341 



14367 

16921 

1577 

8773 

39886 
13532 

5257 

9061 

19267 

20212 

16653 

16985 

12977 

9061 

6468 

16392 

14627 

6583 

6806 

9635 

689 

4119 

8952 

2379 



n> Sequence Name 

RTA00000178AF.e.l2.1 

RTA00000183AF.a.24.1 

RTA00000118A.g.l6.1 

RTA00000187AR.C.5.2 

RTA00000129A.e.24.1 

RTA00000185AF.d.24.1 

RTA00000177AF.O.4.3 

RTA00000184AF.a.l5.1 

RTA00000185AF.g.ll.l 

RTA00000184AF.j.l.l 

RTA00000118A.p.l0.1 

RTA00000120A.g.l2.1 

RTA00000177AF.C.21.3 

RTA00000187AF.h.23.1 



iv j /wuuuu 1 8 / AK. 1. 1 0.2 
RTA00000187AF.e.8.1 
RTA00000193AF.C.22.1 
RTA00000183AF.k.6.1 
RTA00000184AF.L23.1 
RTA00000187AF.f.24.1 
RTA00000194AF.a.ll.l 
RTA00000178AF.j.24.1 
RTA00000181AF.C.4.1 
RTA00000193AF.d.2.1 
RTA00000192AF.f.3.1 
RTA00000191AR.e.ll.2 
RTA00000186AF.I.12.1 
RTA00000134A.1.22.1 
RTA00000181AF.k.5.3 
RTA00000177AF.h.l0.1 
RTA00000189AR.j.l9.1 
RTA00000191AR.e.ll.3 
RTA00000194AR.a.l0.2 
RTA00000187AF.d.l5.1 
RTA00000192AF.1.1.1 
RTA00000187AF.g.23.1 
RTA00000179AF.d.l3.1 
RTA00000177AF.g.l3.3 
RTA00000137A.e.23.4 
RTA00000181AR.1.22.1 
RTA00000183AF.k.l6.1 
RTA00000183AF.h.l5.I 
RTA00000192AF.p.8.1 
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PCT/US98/27610 
Clone Name 

M00001370A:C09 

M00001499B:A1 1 

M00001449A.B12 

M00001662C:A09 

M00001587A:B11 

M00001582D:F05 

M00001358C:C06 

M00001544B:B07 

M00001590B:F03 

M00001556A.H01 

M00001452A:B04 

M00001465A.-B11 

M00001342B.E06 

M00001679A.F06 

M00001679B:F01 

M00001670C.H02 

M00004249D:G12 

M00001534A:C04 

M00001556A:F11 

M00001675A:C09 

M00004461A:B09 

M00001380D:B09 

M00001445A:F05 

M00004251C:G07 

M00004146C.-C11 

M00004031A:A12 

M00001645A:C12 

M00001535A.C06 

M00001453C.F06 

M00001348B:G06 

M00003875B.-F04 

M00004031A:A12 

M00004461A:B08 

M00001669B.F02 

M00004183C:D07 

M00001677C.-E10 

M00001394A.F01 

M00001346D.E03 

M00001557A.F01 

M00001454D.G03 

M00001534C:A01 

M00001518C:B11 

M00004212B:C07 



BNSDOCID: <WO_9933982A2_l_> 



WO 99/33982 
SEQ ID NO: Cluster ID 

342 39486 

343 21877 

344 6874 

345 6874 

346 37285 
347 

348 7674 

349 2797 
350 

351 7802 

352 16921 

353 11494 

354 17062 

355 16245 

356 83103 

357 4309 

358 13072 

359 4059 

360 5178 

361 H20 

362 6420 

363 13913 
364 

365 3956 

366 14364 

367 6880 

368 84182 

369 2790 

370 4561 

371 8847 

372 56020 

373 1531 

374 6420 
375 

376 
377 

378 2696 

379 260 

380 H350 

381 2428 

382 4313 
383 

384 697 



Sequence Name 

RTA00000128A.m.22.2 

RTA00000189AF.b.21.1 

RTA00000192AF.a.l4.1 

RTA00000189AF.e.9.1 

RTA00000191AF.f.ll.l 

RTA00000193AF.j.20.1 

RTA00000118A.g.9.1 

RTA00000180AF.i.l9.1 

RTA00000184AF.g.22.1 

RTA00000185AF.n.5.1 

RTA00000193AF.h.l5.1 

RTA00000192AF.j.6.1 

RTA00000177AF.b.8.4 

RTA00000177AF.k.9.3 

RTA00000119A.e.24.2 

RTA000001 86AF.e.22.1 

RTA00000 1 8 1 AR.m.5.2 

RTA00000177AF.n.l8.3 

RTA00000178AF.n.l0.1 

RTA00000118A.p.l5.3 

RTA00000183AF.d.ll.l 

RTA00000186AF.e.6.1 

RTA00000192AF.C.2.1 

RTA00000183AF.g.3.1 

RTA00000183AF.g.l2.1 

RTA00000191AF.m.20.1 

RTA000001 80AF.h.l 9. 1 

RTA00000177AF.e.2.1 

RTA00000184AF.i.21.1 

RTA00000180AF.b-16.1 

RTA00000193AF.g.2.1 

RTA00000119A.O.3.1 

RTA00000177AF.f.l0.3 

RTA00000188AF.b.l2.1 

RTA00000180AF.k.24.1 

RTA00000184AF.a.8.1 

RTA00000134A-m.l3.1 

RTA00000185AR.i.l2.2 

RTA000001 89AF.a.24.2 

RTA00000123A.1.21.1 

RTA00000122A.n.3.1 

RTA00000184AF.p.3.1 

RTA00000188AF.d.6.1 
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PCTAJS98/27610 
Clone Name 
M00001561A:C05 
M00003833A.E05 
M00004111D:A08 
M00003846B:D06 
M00004035C:A07 
M00004327B:H04 
M00001416A:H01 
M00001429A-.H04 
M00001552D-.A01 
M00001608A-.B03 
M00004295D:F12 
M00004172C:D08 
M00001340B:A06 
M00001352A:E02 
M00001454A:A09 
M00001624C-.F01 
M00001455B-.E12 
M00001357D-.D11 
M00001386C:B12 
M00001452A:D08 
M00001504D:G06 
M00001623D:F10 
M00004121B:G01 
M00001512D:G09 
M00001513C:E08 
M00004087D:A01 
M00001428A-.H10 
M00001343C:F10 
M00001555D:G10 
M00001416B:H11 
M00004285B:E08 
M00001461A:D06 
M00001345A:E01 
M00003754C:E09 
M00001432C:F06 
M00001544A:E06 
M00001536A:B07 
M00001594B-.H04 
M00003826B-.A06 
M00001533A-.C11 
M00001517A:B07 
M00001S66B:D11 
M00003759B:B09 



WO 99/33982 
SEQ ID NO: Cluster 



385 

386 

387 

388 

389 

390 

391 

392 

393 

394 

395 

396 

397 

398 

399 

400 

401 

402 

403 

404 

405 

406 

407 

408 

409 

410 

411 

412 

413 

414 

415 

416 

417 

418 

419 

420 

421 

422 

423 

424 

425 

426 

427 



5619 
4568 

5622 

7514 

5619 

7570 

23358 

23210 

5192 

13538 



14929 
4908 



17732 

17062 

11589 

4443 

39805 

2790 

23255 

6420 

5007 

13576 

16927 

16985 

3584 

7187 

16245 

8078 

14929 

14391 

4141 

2379 

5622 

945 

40132 



ID Sequence Name 

RTA00000188AF.1.9.1 

RTA00000122A.d.l5.3 

RTA00000177AF.i.6.2 

RTA00000178AF.a.ll.l 

RTA00000184AF.k.21.I 

RTA00000189AF.f.l7.l 

RTA00000187AF.g.24.1 

RTA00000190AF.O.21.1 

RTA00000190AF.O.20.1 

RTA00000184AF.L2.1 

RTA00000180AF.a.24.1 

RTA00000189AF.h.I7.1 

RTA00000192AF.O.11.1 

RTA00000184AF.1.11.1 

RTA00000 1 89AF.g.5. 1 

RTA00000177AF.m.l.2 

RTA00000192AF.j.2.1 

RTA00000178AF.k.l6.1 

RTA00000194AF.C.24.1 

RTA00000178AR.i.2.2 

80.Al.sp6:130208.Seq 

80.Bl.sp6:130220.Seq 

80.Cl.sp6:130232.Seq 

80.Dl.sp6:130244.Seq 

80.El.sp6:130256.Seq 

80.Fl.sp6.130268.Seq 

80.Gl.sp6:130280.Seq 

80.Hl.sp6:130292.Seq 

80.D2.sp6.130245.Seq 

80.E2.sp6:130257.Seq 

80.F2.sp6.130269.Seq 

80.G2.sp6:130281.Seq 

80.H2.sp6:130293.Seq 

80.A3.sp6:130210.Seq 

80.D3.sp6:130246.Seq 

80.E3.sp6:130258.Seq 

80.F3.sp6:130270.Seq 

80.G3.sp6:130282.Seq 

80.B4.sp6:130223.Seq 

80.C4.sp6:130235.Seq 

80.D4.sp6:130247.Seq 

80.E4.sp6:130259.Seq 

80.F4.sp6:130271.Seq 
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PCT/US98/27610 
Clone Name 

M00003796C.D05 

M00001513A.B06 

M00001350A:B08 

M00001362B:D10 

M00001558B.H11 

M00003853A:D04 

M00001677D.A07 

M00003974D.H02 

M00003974D:E07 

M00001557B:H10 

M00001415A:H06 

M00003867A:D10 

M00004205D:F06 

M00001559B.F01 

M00003857A.H03 

M00001353D.D10 

M00004171D.B03 

M00001381D.E06 

M00004692A:H08 

M00001376B:G06 

M00001340B:A06 

M00001340D:F10 

M00001341A:E12 

M00001342B.E06 

M00001343C.F10 

M00001343D.H07 

M00001345A.E01 

M00001346A:F09 

M00001347A:B10 

M00001348B:B04 

M00001348B:G06 

M00001349B:B08 

M00001350A.B08 

M00001350A.H01 

M00001352A:E02 

M00001353A:G12 

M00001353D:D10 

M00001355B:G10 

M00001361A:A05 

M00001361D:F08 

M00001362B.D10 

M00001362C.H1 1 

M00001365C.C10 



BNSOOCID: <WO__9933982A2_l_> 



WO 99/33982 
SEQ ID NO: Cluster ID 

428 

429 6867 

430 7172 

431 17732 

432 39833 

433 1334 

434 39886 
435 

436 22979 

437 39648 
438 

439 5178 

440 2464 

441 7587 

442 5832 

443 16269 

444 6583 

445 4009 
446 

447 39563 

448 5556 

449 9577 

450 7005 

451 8551 
452 

453 

454 13538 

455 8847 

456 36393 

457 9952 

458 9577 

459 15066 

460 10470 

461 22195 
462 

463 4261 

464 84182 

465 40392 

466 16731 
467 

468 13532 

469 8 

470 36313 



Sequence Name 

80.G4.sp6:130283.Seq 

80.H4.sp6:130295.Seq 

80.A5.sp6:130212.Seq 

80.B5.sp6:130224.Seq 

80.C5.sp6:130236.Seq 

80.D5.sp6:130248.Seq 

80.E5.sp6:130260.Seq 

80.F5.sp6:130272.Seq 

80.G5.sp6:130284.Seq 

80.H5.sp6:130296.Seq 

80.B6.sp6:130225.Seq 

80.C6.sp6:130237.Seq 

80.D6.sp6:130249.Seq 

80.E6.sp6:130261.Seq 

80.F6.sp6:l 30273. Seq 

80.G6.sp6:130285.Seq 

80.H6.sp6:130297.Seq 

80.A7.sp6:130214.Seq 

80.B7.sp6:130226.Seq 

80.C7.sp6:130238.Seq 

80.D7.sp6:130250.Seq 

80.E7.sp6:130262.Seq 

80.F7.sp6:130274.Seq 

80.G7.sp6:130286.Seq 

80.H7.sp6:130298.Seq 

80.A8.sp6:130215.Seq 

80.B8.sp6:130227.Seq 

80.C8.sp6:130239.Seq 

80.D8.sp6: 130251. Seq 

80.E8.sp6: 130263. Seq 

80.G8.sp6:130287.Seq 

80.H8.sp6:130299.Seq 

80.A9.sp6: 13021 6.Seq 

80.B9.sp6:130228.Seq 

80.C9.sp6:130240.Seq 

80.D9.sp6:130252.Seq 

80.E9.sp6:130264.Seq 

80.H9.sp6:130300.Seq 

80.C10.sp6: 130241. Seq 

80.D10.sp6:130253.Seq 

80.E10.sp6:130265.Seq 

80.H10.sp6:130301.Seq 

80.All.sp6:130218.Seq 
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PCT/US98/27610 
Clone Name 

M00O01368D:EO3 

M00001370A:C09 

M00001371C:E09 

M00001376B:G06 

M00001378B:B02 

M00001379A:A05 

M00001380D:B09 

M00001381D:E06 

M00001382C:A02 

M00001383A:C03 

M00001384B:A11 

M00001386C-.B12 

M00001387A:C05 

M00001387B:GO3 

M00001388D:G05 

M00001389A:C08 

M00001394A:F01 

M00001396A:C03 

M00001400B-.H06 

M00001402A-.E08 

M00001407B:D11 

M00001409C:D12 

M00001410A:D07 

M00001412B:B10 

M00001414A:B01 

M00001414C-.A07 

M00001415A:H06 

M00001416B:H11 

M00001417A:E02 

M00001418B:F03 

M00001421C:F01 

M00001423B:E07 

M00001424B:G09 

M00001425B:H08 

M00001426B:D12 

M00001426D:C08 

M00001428A:H10 

M00001429D:D07 

M00001442C:D07 

M00001443B:F01 

M00001445A:F05 

M00001448D-.C09 

M00001448D:H01 



WO 99/33982 
SEQ ID NO: Cluster 

471 5857 

472 41633 

473 36535 

474 86110 

475 32663 

476 27250 

477 16970 

478 16130 

479 16653 

480 7005 

481 13072 

482 9283 

483 23255 

484 13576 

485 71 «7 

486 14391 

487 945 

488 7172 

489 39648 

490 84182 

491 8 

492 36535 

493 82498 

494 16970 

495 16130 

496 7005 
497 

498 18957 

499 40044 

500 5214 

501 6660 

502 13183 

503 6455 

504 5319 

505 H443 
506 

507 8210 

508 9457 

509 6420 

510 16245 

511 8078 

512 14929 

513 14391 



H) Sequence Name 

80.Bll.sp6:130230.Seq 

80.Cll.sp6:130242.Seq 

80.Dll.sp6:130254.Seq 

80.Ell.sp6:130266.Seq 

80.Fll.sp6:130278.Seq 

80.Gll.sp6:130290.Seq 

80.Hll.sp6:130302.Seq 

80.A12.sp6:130219.Seq 

80.B12.sp6:130231.Seq 

80.C12.sp6:l 30243. Seq 

8O.F12.sp6:130279.Seq 

80.G12.sp6:130291.Seq 

100.Cl.sp6:131446.Seq 

100.El.sp6:131470.Seq 

1 f\f\ /"^""» f - 1 n 1 A nt 

100.E3.sp6:131472.Seq 

100.E4.sp6:131473.Seq 

100.A5.sp6:131426.Seq 

100.A6.sp6.131427.Seq 

100.G9.sp6:131502.Seq 

100.Bll.sp6:131444.Seq 

100.Dll.sp6:131468.Seq 

100.Fll.sp6:131492.Seq 

100.C12.sp6:131457.Seq 

100.D12.sp6:131469.Seq 

121.Dl.sp6:131917.Seq 

121.G6.sp6:131958.Seq 

121. F7.sp6:131947.Seq 

122. El.sp6.132121.Seq 
122.C2.sp6:132098.Seq 

122. B5.sp6:132089.Seq 

123. D5.sp6:132305.Seq 
123.E7.sp6:132319.Seq 
123.F7.sp6:132331.Seq 
123.A8.sp6:132272.Seq 
123.C8.sp6:132296.Seq 
123.E8.sp6:132320.Seq 
123.Dll.sp6: 1323 11. Seq 
172.El.sp6:133925.Seq 
172.D2.sp6:133914.Seq 
172.C3.sp6:133903.Seq 
172.D3.sp6:133915.Seq 
172.H3.sp6:133963.Seq 
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PCT/US98/276I0 
Clone Name 

M00001449A:A12 

M00001449A:B12 

M00001449A:G10 

M00001449C:D06 

M00001450A:A11 

M00001450A:D08 

M00001452C:B06 

M00001453A:E11 

M00001453C:F06 

M00001454B:C12 

M00001455B.-E12 

M00001455D:F09 

M00001343D:H07 

M00001347A.B10 

M0O0O1350A:H01 

M00001355B:G10 

M00001362C.H11 

M00001371C:E09 

M00001383A.C03 

M00001428A:H10 

M00001448D:C09 

M00001449A.G10 

M00001450A:B12 

M00001452C:B06 

M00001453A:E11 

M00001454B:C12 

M00001506D.A09 

M00001528A:F09 

M00001621C.C08 

M00001630B:H09 

M00001679A:A06 

M00004114C.F11 

M00004157C:A09 

M00004169C:C12 

M00004185C:C03 

M00004191D:BU 

M00004197D:H01 

M00004307C:A06 

M00001345A:E01 

M00001352A.E02 

M00001353A:G12 

M00001353D:D10 

M00001355B:G10 



WO 99/33982 
SEQ ID NO: Cluster ID 

514 6583 

515 4009 
516 

517 

518 19267 

519 78091 

520 17055 

521 6539 
522 

523 5257 

524 1 1494 
525 

526 H451 

527 9283 

528 16283 

529 10539 

530 6420 

531 3956 
532 

533 1577 

534 9635 

535 5192 

536 6539 

537 945 

538 7005 

539 39304 

540 27250 

541 35555 

542 19255 

543 6268 

544 3277 

545 39171 

546 H460 

547 10539 

548 H476 

549 4016 

550 7674 

551 3681 

552 39304 

553 82498 

554 84328 

555 86859 

556 1120 



Sequence Name 

172.B8.sp6:133896.Seq 

172.D8.sp6:133920.Seq 

172.B9.sp6:133897.Seq 

176.A3.sp6:134514.Seq 

176.G3.sp6:134586.Seq 

176.G5.sp6:134588.Seq 

176.D6.sp6:134553.Seq 

176. D9.sp6:134556.Seq 

177. H4.sp6:l 34791. Seq 
177.F5.sp6:134768.Seq 
177.E6.sp6:134757.Seq 
177.G7.sp6:134782.Seq 
177.D8.sp6:134747.Seq 
173.D2.SP6:134106.Seq 
173.F3.SP6:134131.Seq 
173.B5.SP6:134085.Seq 
173.F5.SP6:134133.Seq 
173.H5.SP6:134157.Seq 
173.G7.SP6:134147.Seq 
173.C9.SP6:134101.Seq 
173.D9.SP6:134113.Seq 
173.E9.SP6:134125.Seq 
173.A12.SP6:134080.Seq 
180.C2.sp6:135940.Seq 
180.H5.sp6:136003.Seq 
180.G9.sp6:135995.Seq 
180.B10.sp6:135936.Seq 
184.A5.sp6:135530.Seq 
184.B10.sp6:135547.Seq 
184.C12.sp6:135561.Seq 
217.El.sp6:139406.Seq 
217.A12.sp6:139369.Seq 
219.F2.sp6:139035.Seq 
219.F6.sp6:139039.Seq 
219.H8.sp6:139065.Seq 
79.Al.sp6:130016.Seq 
79.Cl.sp6:130040.Seq 
79.El.sp6:130064.Seq 
79.Fl.sp6:130076.Seq 
79.Gl.sp6:130088.Seq 
79.A2.sp6:130017.Seq 
79.B2.sp6:130029.Seq 
79.C2.sp6: 130041. Seq 
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PCT/US98/27610 
Clone Name 

M00001394A:F01 

M00001396A:C03 

M00001400B:H06 

M00001632D:H07 

M00001645A:C12 

M00001679C:F01 

M00001682C:B12 

M00003844C:B11 

M00004121B:G01 

M00004146C:C11 

M00004172C:D08 

M00004205D.F06 

M00004214C-.H05 

M00001455D:F09 

M00001467A:D08 

M00001499B:A11 

M00001504D:G06 

M00001512D:G09 

M00001544A:E06 

M00001556A:F11 

M00001557A:F01 

M00001557B-.H10 

M00001579D:C03 

M00001362C:H1 1 

M00001410A:D07 

M00001450A-.A02 

M00001450A:D08 

M00001528A:C04 

M00001545A:C03 

M00001551A:B10 

M00001624A-.B06 

M00001644C:B07 

M00001676B:F05 

M00001680D:F08 

M00003747D:C05 

M00001395A-.C03 

M00001416A:H01 

M00001449A:D12 

M00001450A:A02 

M00001450A:B12 

M00001452A:B04 

M00001452A:B12 

M00001452A:D08 



WO 99/33982 
SEQ ID NO: Cluster 



557 


85064 


558 


83103 


559 


10145 


560 


16283 


561 


4568 


562 


4313 


563 


2428 


564 


39423 


565 


39174 


566 


22113 


567 


19829 


568 


13864 


569 


1058 


570 


4015 







572 

573 

574 

575 

576 

577 

578 

579 

580 

581 

582 

583 

584 

585 

586 

587 

588 

589 

590 

591 

592 

593 

594 

595 

596 

597 

598 

599 



307 

39458 

39490 

39486 

39380 

1399 

39391 

6268 

2448 
1531 
19 

38759 
39508 
16283 
39442 
7589 

12081 

19105 

1037 

39425 

39478 



18699 

7206 

2623 



ED Sequence Name 

79.D2.sp6:130053.Seq 

79.G2.sp6:130089.Seq 

79.F3.sp6:130078.Seq 

79.H3.sp6:] 30 102.Seq 

79.D4.sp6:130055.Seq 

79.F4.sp6:130079.Seq 

79.A5.sp6:130020.Seq 

79.C5.sp6:130044.Seq 

79.E5.sp6:130068.Seq 

79.F5.sp6:130080.Seq 

79.H5.sp6:I30104.Seq 

79.B6.sp6:l 30033. Seq 

79.F6.sp6.I30081.Seq 

79.G6.sp6.130093.Seq 

79.A7.sp6:130022.Seq 

79.C7.sp6.130046.Seq 

79.D7.sp6:130058.Seq 

79.G7.sp6:130094.Seq 

79.B8.sp6:130035.Seq 

79.E8.sp6: 130071. Seq 

79.G8.sp6:130095.Seq 

79.A9.sp6:130024.Seq 

79.G9.sp6.130096.Seq 

377.F4.sp6:141957.Seq 

89.Al.sp6:130667.Seq 

89.Cl.sp6: 130691. Seq 

89.Dl.sp6: 130703 .Seq 

89.Fl.sp6:130727.Seq 

89.Gl.sp6:130739.Seq 

89.Hl.sp6: 13075 l.Seq 

89.A2.sp6:130668.Seq 

89.B2.sp6:130680.Seq 

89.C2.sp6:130692.Seq 

89.D2.sp6:130704.Seq 

89.E2.sp6:130716.Seq 

89.F2.sp6:130728.Seq 

89.G2.sp6:130740.Seq 

89.H2.sp6.130752.Seq 

89.B3.sp6:130681.Seq 

89.C3.sp6:130693.Seq 

89.D3.sp6:130705.Seq 

89.E3.sp6:130717.Seq 

89.F3.sp6:130729.Seq 
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PCT/US98/27610 
Clone Name 

M00001452A.F05 

M00001454A.A09 

M00001465A.B11 

M00001467A.D08 

M00001513A.B06 

M00001517A.B07 

M00001533A.C11 

M00001535A:F10 

M00001541A:H03 

M00001542A:A09 

M00001544A:G02 

M00001545A:D08 

M00001548A:H09 

M00001549A:B02 

M00001551A:F05 

M00001552A:B12 

M00001552A:D11 

M00001557A:F03 

M00001561A:C05 

M00001587A:B11 

M00001604A.B10 

M00001604A.F05 

M00001551A:B10 

M00004692A.H08 

M00001460A:F06 

M00001461A.D06 

M00001463C.B11 

M00001467A:B07 

M00001467A.D04 

M00001467A.D08 

M00001467A:E10 

M00001468A.-F05 

M00001469A.A01 

M00001469A:C10 

M00001469A:H12 

M00001470A:B10 

M00001470A:C04 

M00001471A.B01 

M00001487B.H06 

M00001488B:F12 

M00001490B.C04 

M00001494D:F06 

M00001497A.G02 



BNSDOCID: <WO__993398ZA2 I 



WO 99/33982 
SEQIDNO: Cluster D) Sequence Name 



600 


10539 


89.G3.sp6:130741.Seq 


601 


5336 


89.H3.sp6:130753.Seq 


602 


2623 


89.A4.sp6:130670.Seq 


603 


9443 


89.B4.sp6:130682.Seq 


604 


9685 


89.C4.sp6:130694.Seq 


605 




89.D4.sp6:130706.Seq 


606 


10185 


89.E4.sp6:130718.Seq 


607 


6974 


89.F4.sp6:130730.Seq 


608 


6420 


89.G4.sp6:130742.Seq 


609 




89.H4.sp6:130754.Seq 


610 




89.A5.sp6:130671.Seq 


611 


39168 


89.B5.sp6:130683.Seq 


612 


39412 


89.C5.sp6:130695.Seq 


613 


39186 


89.D5.sp6:130707.Seq 


614 


3956 


89.E5.sp6:130719.Seq 


615 




89 .F5.sp6: 13073 l.Seq 



OlO 


14364 


89.G5.sp6:130743.Seq 


0 1 / 


40044 


89.H5.sp6:130755.Seq 


OlO 


8952 


89.A6.sp6:130672.Seq 




35555 


89.B6.sp6:130684.Seq 




18957 


89.C6.sp6:130696.Seq 


621 


8358 


89.D6.sp6:130708.Seq 


622 


38085 


89.E6.sp6:130720.Seq 


623 




89.F6.sp6:130732.Seq 


624 


3990 


89.G6.sp6:130744.Seq 


625 


16921 


89.H6.sp6:130756.Seq 


626 


5321 


89.B7.sp6:130685.Seq 


627 


4119 


89.C7.sp6:130697.Seq 


628 


20212 


89.E7.sp6:130721.Seq 


629 


2696 


89.F7.sp6:130733.Seq 


630 


39392 


89.G7.sp6:130745.Seq 


631 


39420 


89.H7.sp6:130757.Seq 


632 


3389 


89.A8.sp6:130674.Seq 


633 


8286 


89.B8.sp6:130686.Seq 


634 


3765 


89.C8.sp6:130698.Seq 


635 


39453 


89.E8.sp6:130722.Seq 


636 




89.F8.sp6:130734.Seq 


637 




89.H8.sp6:130758.Seq 


638 


6974 


89.A9.sp6:130675.Seq 


639 




89.B9.sp6:130687.Seq 


640 


19255 


89.C9.sp6:130699.Seq 


641 


1267 


89.D9.sp6: 13071 l.Seq 


642 


5892 


89.E9.sp6:130723.Seq 
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PCT/US98/27610 
Clone Name 
M00001499B:AU 
M00001500A:C05 
M00001500A:E11 
M00001500C:E04 
M00001501D:C02 
M00001504A:E01 
M00001504C:A07 
M00001504C:H06 
M00001504D-.G06 
M00001505C:C05 
M00001506D:A09 
M00001507A:H05 
M00001511A:H06 
M00001512A:A09 
M00001512D-.G09 
M00001513B:G03 
M00001513C:E08 
M00001514C:D11 
M00001518C:B11 
M00001528A:C04 
M00001528A:F09 
M00001528B:H04 
M00001531A:D01 
M00001531A:H11 
M00001532B:A06 
M00001534A:C04 
M00001534A:F09 
M00001534C:A01 
M00001535A:C06 
M00001536A:B07 
M00001536A:C08 
M00001537A:F12 
M00001537B:G07 
M00001540A:D06 
M00001541A:D02 
M00001542A:E06 
M00001542B:B01 
M00001544A:E06 
M00001544B-.B07 
M00001545A-.B02 
M00001545A:C03 
M00001546A:G11 
M00001548A:E10 




WO 99/33982 



SEQ ID NO: Cluster ED Sequence Name 



643 


4193 


89.G9.sp6:130747.Seq 


644 


16347 


89.H9.sp6:130759.Seq 


645 


7239 


89.A10.sp6:130676.Seq 


646 


5175 


89.B10.sp6:130688.Seq 


647 


22390 


89.C10.sp6:130700.Seq 


648 


3266 


89.D10.sp6:130712.Seq 


649 


5708 


89.E10.sp6:130724.Seq 


650 




89.F10.sp6:130736.Seq 


651 


8298 


89.G10.sp6:130748.Seq 


652 


4573 


89.H10.sp6:130760.Seq 


653 


22814 


89.All.sp6:130677.Seq 


654 


39539 


89.Bll.sp6.130689.Seq 


655 


39195 


89.Cll.sp6:130701.Seq 


656 


4561 


89.Dll.sp6:130713.Seq 


657 


9244 


89.Ell.sp6:130725.Seq 


658 


1577 


89.Fll.sp6:130737.Seq 


659 


4386 


89.Hll.sp6:130761.Seq 


660 


11294 


89.A12.sp6:130678.Seq 


661 


5192 


89.D12.sp6:130714.Seq 


662 


8761 


89.E12.sp6:130726.Seq 


663 




89.F12.sp6:130738.Seq 


664 


7514 


89.G12.sp6:130750.Seq 


665 




89.H12.sp6:130762.Seq 


666 


6558 


90.Al.sp6:130859.Seq 


667 


102 


90.Bl.sp6:130871.Seq 


668 




90.Dl.sp6:130895.Seq 


669 


5749 


90.El.sp6:130907.Seq 


670 


6539 


90.Gl.sp6:130931.Seq 


671 


6293 


90.A2.sp6:130860.Seq 


672 




90.C2.sp6:130884.Seq 


673 


260 


90.D2.sp6:130896.Seq 


674 


4837 


90.E2.sp6:130908.Seq 


675 


10470 


9O.F2.sp6:130920.Seq 


676 


16999 


90.G2.sp6:130932.Seq 


677 


22794 


90.H2.sp6:130944.Seq 


678 


11465 


90.A3.sp6:130861.Seq 


679 


7802 


90.B3.sp6:130873.Seq 


680 


22155 


90.C3.sp6:130885.Seq 


681 




90.D3.sp6:130897.Seq 


682 


13157 


90.E3.sp6:130909.Seq 


683 


17004 


90.F3.sp6:130921.Seq 


684 


40314 


90.G3.sp6:130933.Seq 


685 


40044 


90.H3.sp6:130945.Seq 
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PCT/US98/27610 
Clone Name 
M00001549B:F06 
M00001549C:E06 
MO0OO155OA:AO3 
M00001550A:G01 
M00001551A:G06 
M00001551C:G09 
M00001552B:D04 
M00001552D:A01 
M00001553A:H06 
M00001553B:F12 
M00001553D:D10 
M00001555A:B02 
M00001555A:C01 
M00001555D:G10 
M00001556A:C09 
M00001556A.F1 1 
M00001556B.C08 
M00001556B:G02 
M00001557B:H10 
M00001557D:D09 
M00001558A.H05 
M00001558B:H11 
M00001559B:F01 
M00001560D:F10 
M00001563B:F06 
M00001566B:D11 
M00001571C:H06 
M00001579D:C03 

M00001583D:A10 

M00001590B:F03 

M00001594B:H04 

M00001597C:H02 

M00001597D.C05 

M00001598A:G03 

M00001601A:D08 

M00001607A:E11 

M00001608A:B03 

M00001608B:E03 

M00001608D:A11 

M00001614C:F10 

M00001617C:E02 

M00001619C:F12 

M00001621C:C08 



BNSDOCID: <WO_9933982A2_l_> 



WO 99/33982 
SEQ ID NO: Cluster ID Sequence Name 



686 


13913 


90.A4.sp6:130862.Seq 


687 


3277 


90.B4.sp6:130874.Seq 


688 


4309 


90.C4.sp6:130886.Seq 


689 


5214 


90.D4.sp6:130898.Seq 


690 




90.E4.sp6:130910.Seq 


691 


39171 


90.F4.sp6:130922.Seq 


692 


19267 


90.G4.sp6:130934.Seq 


693 


4665 


90.H4.sp6:130946.Seq 


694 




90.A5.sp6:130863.Seq 


695 


23201 


90.B5.sp6:130875.Seq 


696 


76760 


90.C5.sp6:130887.Seq 


697 


23218 


90.D5.sp6:130899.Seq 


698 


35702 


90.E5.sp6:130911.Seq 


699 


6468 


90.F5.sp6:130923.Seq 


700 


14367 


90.G5.sp6:130935.Seq 


701 


7015 


90.H5.sp6:130947.Seq 


702 


8773 


90.A6.sp6:130864.Seq 


703 


11460 


90.B6.sp6:130876.Seq 


704 


7570 


90.D6.sp6:130900.Seq 


705 


4416 


90.E6.sp6:130912.Seq 


706 


6660 


90.F6.sp6:130924.Seq 


707 




90.H6.sp6:130948.Seq 



708 


26875 


90.A7.sp6:130865.Seq 


709 


6298 


90.B7.sp6:130877.Seq 


710 


78091 


90.C7.sp6:130889.Seq 


711 


10751 


90.D7.sp6:130901.Seq 


712 


10539 


90.F7.sp6:130925.Seq 


713 


17055 


90.G7.sp6:130937.Seq 


714 


5382 


90.A8.sp6:130866.Seq 


715 


4393 


90.B8.sp6:130878.Seq 


716 


67252 


90.C8.sp6:130890.Seq 


717 


40108 


90.D8.sp6:130902.Seq 


718 


11476 


90.E8.sp6:130914.Seq 


719 




90.F8.sp6:130926.Seq 


720 


697 


90.G8.sp6:130938.Seq 


721 




90.H8.sp6:130950.Seq 


722 


17076 


90.A9.sp6:130867.Seq 


723 


3108 


90.B9.sp6:130879.Seq 


724 


67907 


90.C9.sp6:130891.Seq 


725 




90.D9.sp6:130903.Seq 


726 


11350 


90.F9.sp6:130927.Seq 


727 


7899 


90.H9.sp6:130951.Seq 


728 


7798 


90.A 1 0.sp6: 1 30868.Seq 



PCT/US98/27610 
Clone Name 
M00001623D:F10 
M00001624A:B06 
M00001624C:F01 
M00001630B:H09 
M00001632D:H07 
M00001644C:B07 
M00001645A:C12 
M00001648C:A01 
M00001651A:H01 
M00001657D:C03 
M00001657D:F08 
M00001662C:A09 
M00001663A:E04 
M00001669B:F02 
M00001670C:H02 
M00001673C:H02 
M00001675A:C09 
M00001676B:F05 
M00001677D:A07 
M00001678D:F12 
M00001679A:A06 
M00001679A-.F06 
M00001679A:F10 
M00001679B:F01 
M00001679C:F01 
M00001679D:D03 
M00001680D-.F08 
M00001682C:B12 
M00001688C-.F09 
M00001693C:G01 
M00001716D:H05 
M00003741D-.C09 
M00003747D:C05 
M00003754C-.E09 
M00003759B:B09 
M00003761D.A09 
M00003762C:B08 
M00003763A:F06 
M00003774C:A03 
M00003784D-.D12 
M00003826B:A06 
M00003837D-.A01 
M00003839A:D08 



WO 99/33982 
SEQ ED NO: Cluster 

729 6539 

730 6874 
731 

732 13595 

733 5619 

734 10515 

735 4622 

736 3389 

737 4718 
738 

739 12977 

740 8479 
741 

742 7798 

743 5345 

744 31587 

745 14507 

746 13576 
747 

748 9285 

749 39809 

750 16317 

751 8672 

752 J2532 

753 3900 

754 23255 

755 24488 

756 40122 

757 23210 

758 23358 

759 3430 

760 2433 

761 9105 

762 6124 

763 40073 

764 37285 

765 17036 

766 3706 
767 

768 15069 

769 9285 

770 6880 

771 5325 



ID Sequence Name 

90.B10.sp6:130880.Seq 

90.C10.sp6:130892.Seq 

90.D10.sp6:130904.Seq 

90.E10.sp6:130916.Seq 

90.F10.sp6:130928.Seq 

90.G10.sp6:130940.Seq 

90.H10.sp6:130952.Seq 

90.Al].sp6:130869.Seq 

90.Bll.sp6:130881.Seq 

90.Cll.sp6:130893.Seq 

90.Fll.sp6:130929.Seq 

90.Gll.sp6:130941.Seq 

90.Hll.sp6:130953.Seq 

90.A12.sp6:130870.Seq 

90.C12.sp6:130894.Seq 

90.D12.sp6:130906.Seq 

90.F12.sp6.130930.Seq 

90.G12.sp6:130942.Seq 

90.H12.sp6:130954.Seq 

99.Al.sp6:131230.Seq 

99.Bl.sp6:131242.Seq 

99.Cl.sp6:131254.Seq 

99.Dl.sp6:131266.Seq 

99.El.sp6:131278.Seq 

99.Fl.sp6.131290.Seq 

99.C2.sp6:131255.Seq 

99.D2.sp6:131267.Seq 

99.E2.sp6:131279.Seq 

99.F2.sp6: 13 129 l.Seq 

99.A3.sp6.131232.Seq 

99.B3.sp6:131244.Seq 

99.C3.sp6:131256.Seq 

99.D3.sp6:131268.Seq 

99.E3.sp6:131280.Seq 

99.H3.sp6:131316.Seq 

99.A4.sp6:I31233.Seq 

99.C4.sp6:131257.Seq 

99.D4.sp6:131269.Seq 

99.F4.sp6:131293.Seq 

99.H4.sp6:131317.Seq 

99.A5.sp6.131234.Seq 

99.C5.sp6:131258.Seq 
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PCT/US98/27610 
Clone Name 

M00003844C:B11 

M00003846B:D06 

M00003851B:D08 

M00003851B.D10 

M00003853A:D04 

M00003853A:F12 

M00003856B:C02 

M00003857A.G10 

M00003857A.H03 

M00003867A:D10 

M00003875B:F04 

M00003875C.G07 

M00003875D:D1 1 

M00003876D:E12 

M00003879B:C11 

M00003879B:D10 

M00003879D:A02 

M00003885C:A02 

M00003891C.H09 

M00003906C:E10 

M00003907D:A09 

M00003907D.H04 

M00003909D:C03 

M00003912B.D01 

M00003914C:F05 

M00003922A:E06 

M00003968B:F06 

M00003970C:B09 

M00003974D:E07 

M00003974D:H02 

M00003981A.E10 

M00003982C:C02 

M00003983A:A05 

M00004028D:A06 

M00004028D.-C05 

M00004035C.A07 

M00004035D:B06 

M00004068B:A01 

M00004072A:C03 

M00004081C:D10 

M00004086D:G06 

M00004087D:A01 

M00004093D.B12 



WO 99/33982 
SEQ ID NO: Cluster ID 

772 7221 

773 4937 

774 6874 

775 13183 
776 

777 13272 

778 5257 

779 6455 

780 5319 

781 4908 

782 11494 

783 11443 
784 

785 8210 

786 14311 
787 

788 12971 

789 6455 

790 7212 

791 4905 

792 16914 

793 16921 

794 13046 

795 9457 

796 26295 

797 21847 
798 

799 
800 

801 39304 

802 2428 

803 1058 

804 5097 

805 20212 

806 23255 

807 2790 

808 6420 

809 4059 
810 

81 1 9952 

812 13238 

813 9685 
814 



Sequence Name 

99.D5.sp6:131270.Seq 

99.E5.sp6:131282.Seq 

99.F5.sp6:131294.Seq 

99.G5.sp6:131306.Seq 

99.H5.sp6:131318.Seq 

99.A6.sp6:131235.Seq 

99.B6.sp6:131247.Seq 

99.D6.sp6:131271.Seq 

99.E6.sp6:131283.Seq 

99.F6.sp6:131295.Seq 

99.G6.sp6:131307.Seq 

99.A7.sp6:131236.Seq 

99.B7.sp6:131248.Seq 

99.C7.sp6:131260.Seq 

99.D7.sp6:131272.Seq 

99.E7.sp6:131284.Seq 

99.B8.sp6:131249.Seq 

99.C8.sp6:131261.Seq 

99.D8.sp6:131273.Seq 

99.H8.sp6:131321.Seq 

99.A9.sp6:131238.Seq 

99.D9.sp6:131274.Seq 

99.E9.sp6:131286.Seq 

99.F9.sp6:131298.Seq 

99.G9.sp6:131310.Seq 

99.H9.sp6:131322.Seq 

99.H10.sp6:131323.Seq 

99.Bll.sp6:131252.Seq 

99.Dll.sp6:131276.Seq 

RTA000001 1 8A.j .21.1 .Seq_THC 151859 

RTA00000 1 23A.1.2 1 . 1 .Seq_THC205063 

RTA00000126A.e.20.3.Seq_THC217534 

RTA000001 34A.k. 1 . 1 .Seq_THC2 1 5869 

RTA00000134A.1.22.1 .Seq_THC 128232 

RTA00000177AF.e.l4.3.Seq_THC228776 

RTA00000 1 77 AF .e.2. 1 . Seq_THC22946 1 

RTA00000177AF.f.l0.3.Seq_THC226443 

RTA000001 77AF.n.l 8.3 .Seq_THC 12305 1 

RTA00000 1 79AFJ .13.1 .Seq_THC 1 05 720 

RTA000001 80AF.C.20. 1 .Seq_THC 1 62284 

RTAOOOOO 1 8 1 AF.m.4. 1 .Seq_THC 1 4069 1 

RTAOOOOO 1 83 AF.c. 1 1 . 1 .Seq_THC 1 09544 

RTA000001 83AF.C.24. 1 .Seq_THC125912 
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Clone Name 
M00004105C:A04 
M00004108A.E06 
M00004111D:A08 
M00004114C:F11 
M00004121B:G01 
M00004138B:H02 
M00004146C:C11 
M00004157C:A09 
M00004169C:C12 
M00004171D:B03 
M00004172C:D08 
M00004185C:C03 
M00004191D-.B11 
M00004197D:H01 
M00004203B:C12 
M00004205D:F06 
M00004223D-.E04 
M00004229B:F08 
M00004230B:C07 
M00004269D:D06 
M00004275C:C11 
M00004295D:F12 
M00004296C:H07 
M00004307C:A06 
M00004312A:G03 
M00004318C:D10 
M00004505D:F08 
M00004692A:H08 
M00005180C:G03 



9933982A2 l_> 



WO 99/33982 
SEQIDNO: Cluster ID 



815 


6420 


816 


6974 


817 


40044 


818 




819 


5892 


820 


40044 


821 




822 


19267 


823 


8773 


824 


7570 




1 1476 


826 




827 


17076 


828 


697 


800 








830 


5619 


831 


4718 


832 


39809 


833 


23255 


834 


40122 


835 


23210 


836 


23358 


837 


5693 


838 


2433 


839 


5257 


840 


16392 


841 




842 


26295 


843 




844 





Sequence Name 
RTA00000 1 83 AF.d. 11 . 1 .Seq_THC226443 
RTA00000 1 83AF.d.9. 1 .Seq_THC223 1 29 
RTAOOOOO 1 83 AF.g.22.1 .Seq_THC232899 
RTA00000 1 83 AF.g.9. 1 .Seq_THC 1 98280 
RTA00000 1 84AF.d. 1 1 . 1 .Seq_THC 1 6 1 896 
RTA00000 1 86AF.d. 1 . 1 .Seq_THC232899 
RTA00000 1 86AF.h. 1 4. 1 .Seq_THCl 1 2525 
RTAOOOOO 1 86 AF.1. 12.1 .Seq_THC 1 78 1 83 
RTAOOOOO 1 87AF.f.24. 1 .Seq_THC220002 
RTA000001 87AF.g.24.1 .Seq_THCl 68636 
RTAOOOOO 1 87AF.p. 1 9. 1 .Seq_THC 1 08482 
RTAOOOOO 1 88 AF.d. 11.1 .Seq_THC2 1 2094 
RTAOOOOO 1 88 AF.d.2 1 . 1 .Seq_THC208760 
RTAOOOOO 1 88AF.d.6. 1 .Seq_THCl 78884 
RTAOOOOO I SSAF.g. 11.1 .Seq_THC 1 23222 
RTAOOOOO 1 88AF.1.9.1 .Seq_THC 167845 
RTAOOOOO 1 89AF.g.5. 1 .Seq_THC 1 96 1 02 
RTAOOOOO 1 90AF.e.3 . 1 .Seq_THC 1 502 1 7 
RTAOOOOO 1 90AFJ.4. 1 .Seq_THC228776 
RTAOOOOO 1 90AF.n.23 . 1 .Seq_THC 1 09227 
RTAOOOOO 1 90AF.O.20. 1 .Seq_THC207240 
RTAOOOOO 1 90AF.O.2 1 . 1 .Seq_THC207240 
RTAOOOOO 1 90AF.p. 1 7.2.Seq_THC 1 733 1 8 
RTAOOOOO 1 9 1 AF.a. 1 5.2.Seq_THC79498 
RTAOOOOO 1 92AF.f.3 . 1 .Seq_THC2 1 3833 
RTAOOOOO 1 92AF.1. 1 . 1 .Seq_THC20207 1 
RTAOOOOO 1 93 AF.c.2 1 . 1 .Seq_THC222602 
RTA00000193AF.i.24.2.Seq_THC197345 
RTAOOOOO 1 93 AF.m.5 . 1 .Seq_THC 1 733 1 8 
RTAOOOOO 1 93AF.n. 15.1 .Seq_THC2 1 5687 
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Clone Name 
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Table 2 



[Nearest 
I Neighbor 
|(BlastN vs. 
I Gen bank) 



[Nearest 
I Neighbor 
|(BlastX vs. 
]Non- 

I Redundant 
I Proteins) 



SEQ I ACCESSION | DESCK1F1 ION 
ID 



P | ACCESSION I 
VALUE 



DESCRIPTION 



1 



<NONE> 



<NONE> 



<NONE>| <NONE> 



2 [ <NONE> 

3 I <NONE> 
"4 I <NONE> 



<NONE> 
<NONE> 
<NONE> 



<NONE> 
<NONE> 
<NONE> 



<NONE> 



<NONE> 



<NONE> 
<NONE> 
|BAR3_CHllb 



<NONE> 



VALUE 

<NONE> I 



ICYAA^PODA 
N 



<NONE> 



<NONE> 



<NONE> I VP03_HSVSA 



<NONE> 
<NONE> 
BALBIANI RINU 
PROTEIN 3 
PRECURSOR>PIR2:S 
08167 Balbiani ring 3 
protein - midge 
(Chironomus 
tentans)>GP:CTBR3J 
C;tentans balbiani ring 
3 (BR3) gene 
ADENYLATE" 
CYCLASE (EC 
4.6.1.1) (ATP 
PYROPHOSPHATE- 
LYASE) (ADENYLYL 
CYCLASE)>P1R2:JC4 
747 adenylate cyclase 
(EC 4.6.1.1)- 
Podospora 

anserina>GP:PANADC | 
Y_l Podospora 
anserina adenyl cyclase 
gene, exons 1-4 
PROBABLE 
MEMBRANE 
ANTIGEN 3 
(TEGUMENT 
PROTEIN)>PIR2:C36 
806 hypothetical 
protein ORF3 - 
saimiriine herpesvirus 1 ! 
(strain 

11)>GP:HSGEND_3 
Herpesvirus saimiri 
complete genome 
DNA; ORF 03; 
similarity to ORF 75 
and EBV BNRF1 



0.97 
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J Nearest 
jNeighbor 
KBIastN vs. 
I Gen bank) 



Table 2 



PCT/US98/27610 



slq (accession 

ID 

<NONE> 



<NONE> 



P 

VALUE 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 



ACCESSION 



<NONE>| ATFCA2 18 



DESCRIPTION 



8 I <NONE> 



<NONE> 



v <NONE> <NONF> UnoNE> NCU50264 1 



10 I <NONE> 



<NONE> I <NONE> 



Arabidopsis thaiiana 
DNA chromosome 4, 
ESSA I contig 
fragment No; 2; 
Hydroxyproline-rich 
glycoprotein homolog; 
Similarity to 
hydroxyproline-rich 
glycoprotein precursor- 

. , com mon tobacco 

<NUNE>|DHAL_ASPN ALDEHYDE 

DEHYDROGENASE 
(EC 1.2.1.3) 

(ALDDH)>GP:ASNA 
LDAAJ Aspergillus 
niger aldehyde 
dehydrogenase (aldA) 
I gene, complete cds 
Neurospora crassa two- 
component histidine 
kinase (nik-l)gene, 5' 
region and partial cds 



11 I <NONE> 



<NONE> 



JNtuu_BOVI NEUROGRANIN 
N l(P17)(B-50 

IMMUNOREACTIVE 
C-KINASE 
SUBSTRATE) 
(BICKS) 

(FRAGMENT)>PIR2: 
A39034 neurogranin - 

1 i bo vine (fragment) 

snunis> HUMB >S1IN Homo sapiens bystin" 



<NONE> 



<NONE> I <NONE> I 



mRNA, complete cds 



13 I <NONE> 



<NONE> 



a 1 BMP1J | Bos taurus BMP1 gene, 
partial sequence; Bone 
morphogenetic protein 



1* I <NONE> 



|-iNUNfc>|rcCYSPRol l;congolensemRNA 
_1 for (prepro) cysteine 

I proteinase 



<NONE> UnoNH>| P60_LISIV 



PROTEIN P60 
PRECURSOR 
(INVASION- 
ASSOCIATED 
PROTEIN)>GP:LISIA 
PRELB 1 Listeria 



162 



P 

VALUE 



0.93 



0.9 



0.86 



0.82 



0.81 



0.69 



0.56 



0.15 
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PCT/US98/27610 



Table 2 



SEQ 
ID 



I Nearest 
1 Neighbor 
|(BlastN vs. 
iGenbank) 



I ACCESSION 



15 I <NONfc> 



DESCRIPTION 



P 

VALUE 



[Nearest 
Neighbor 
(BiastXvs. 
Non- 

1 Redundant 
[Proteins) 
ACCESSION 



<NONE> 



DESCRIPTION 



16 I <NONE> 



17 I <NONE> 



18 I <NONE> 

19 I <NONE> 



ivanovii extracellular 
protein homologue 
(iap) gene, complete 
cds 



<NONE>|HEX ADE31 HEXON PROTEIN 

(LATE PROTEIN 2) 
(FRAGMENT)>PIR2: 
S37217 hexon protein - 
human adenovirus 31 
(fragment)>GP:HSAT3 
1H_1 H;sapiens 
adenovirus type 3 1 
hexon gene; Hexon 
protein; Internal 
fragment containing 
1 hypervariable regions 



0.15 



<NONE> <NONE>| HSU77493.1 



Human Notch2 mRNA, j 
partial cds; 
Transmembrane 
protein; hN 



0.13 



<NONE> <NONE> C YB_PARTE C Y'l OCHROMfc B 

(EC 

1.10.2.2)>PIR2:S07743 
cytochrome b - 
Paramecium tetraurelia 
mitochondrion 
(SGC6)>GP:M1PAGE 
N_l 9 Paramecium 
aurelia mitochondrial 
complete genome; 
Apocytochrome b (AA 
1-391) 



0.078 



<NONE> 
<NONE> 



<NONE> 1hUMERB27_ Human c-erbtf-^ gene 



<NONE> I DMTRXIIIJ2 



e xon 7; C-erb-2 protein 1 
D;melanogaster DNA 
for trxl and trxll genes; 
Trithorax protein trxl; 
Trithorax; 
putative>GP:DMTTHO 

RAX_2 

D;melanogaster DNA 
for (putative) trithorax 
protein; Predicted 
trithorax protein 



0.054 



0.047 



163 



WO 99/33982 



Table 2 



PCT/US98/27610 



I Nearest 
I Neighbor 
|(BlastN vs. 
I Gen bank) 



SEQ|ACCESSION 
ID 



20 



<NONE> 



21 



<NONE> 



DESCRIPTION 



<NONE> 



P 

VALUE 



I Nearest 
I Neighbor 
[(BlastX vs. 
Non- 

I Redundant 
[Proteins) 



ACCESSION 



<NONE> 



<NONE>| CELB0281 5 



22 



<NONE> 



23 



<NONE> 



DESCRIPTION 



Caenorhabditis elegans 
cosmid B0281; Similar 
to reverse transcriptases 



P 

VALUE 



0.043 



<NONE> <NONE>| A56263 



<NONE> <NONE> 



24 



<NONE> 



<NONE> <NONE>| SI 6323 



<NONE>|MOT*_V1BP I SODIUM-TYPE 1 0 .041 

A FLAGELLAR 

PROTEIN MOTY 
PRECURSOR>GP:VP 
U06949_4 Vibrio 
parahaemolyticus 
BB22 RNase T (rnt) 
gene and flagellar 
motor component 
(motY) gene, complete 
cds 

beta-galactosidase (EC | 07" 
3.2.1.23) isozyme 12- 
Arthrobacter sp. (strain 
B7)>GP:ASU17417_1 
Arthrobacter sp; beta- 
galactosidase gene, 
complete cds 

USAJ'SEAE |GLUTAMATE-1- 
SEMIALDEHYDE 
2,1-AMINOMUTASE 
(EC 5.4.3.8) (GSA) 
(GLUTAMATE- 1 - 
SEMIALDEHYDE 
AMINOTRANSFERA 
SE) (GSA- 

AT)>PIR2:S57898 
glutamate 1- 
semialdehyde 2,1- 
aminomutase - 
Pseudomonas 
aeruginosa>GP:PAHE 
ML_1 P;aeruginosa 
hemL gene; Glutamate 
1-sem 



hypothetical protein - 
Arabidopsis 
thaiiana>GP:ATHB 1_ 1 
A;thaliana homeobox 
geneAthb-1 mRNA; 
Open reading frame 



0.035 
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Table 2 



PCT/US98/27610 



I Nearest 
Neighbor 
(BlastN vs. 

iGenbank) 



[Nearest 
I Neighbor 
|(BlastX vs. 
|Non- 

1 Redundant 
I Proteins) 



I ACCESSION 



DESCRIPTION 



P 

VALUE 



ACCESSION 



<NONE> 



<NONE> |<N0NE>1 IRS1_RAT 



INSULIN RECEPTOR 
SUBSTRATE- 
1>PIR2:S 16948 
hypothetical protein 
IRS-1 - 

rat>GP:RNIRSHRMJ 
R;Norvegicus IRS-1 
mRNA for insulin- 
receptor; During insulin 
stimulation, undergoes 
tyrosine 

phosphorylation and 
binds 

phosphatidylinositol 3- 
kinase 



<NONE> 



<NONE> <NONE> I CEM02G9_2 



DESCRIPTION 



P 

VALUE 



0.027 



Caenorhabditis elegans 
cosmid M02G9; 
M02G9;1; Similar to 
keratin like protein; 
cDNA EST 
yk308gll;5 comes 
from this gene; cDNA 
EST yk208el 1;5 comes 
from this gene; cDNA 
EST yk208el 1;3 comes 



0.0088 



<NONE> 



<NONE> |<NONE>| S75490_3 



competence region: 
iga=IgA protease, 
comA=transformation 
competence [Neisseria 
gonorrhoeae, MS 1 1 , 
Genomic, 3 genes, 
2664 nt] 



<NONE> 



<NONE> I <NONE> 



|EXTN_TOBA 

c 



EXTENSIN 
PRECURSOR (CELL 
WALL 

HYDROXYPROLINE- 
RICH 
GLYCOPROTEIN)>PI 
R2:S06733 
hydroxyproline-rich 
glycoprotein precursor 
- common 

tobacco>GP:NTEXT 1 
Tobacco HRGPnt3 
gene for extensin; 
Extensin (A A 1-620) 
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Nearest 
Neighbor 
(BJastN vs. 
Cenbank) 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 



SEQ 
ID 



29 



<NONE> 



30 



<NONE> 



DESCRIPTION 



<NONE> 



<NONE> 



P 

VALUE 



ACCESSION 



<NONE> 



HPCEGS 1 



<NONE> 



HHVBC 4 



DESCRIPTION 



Hepatitis C virus 
complete genome 
sequence; Polyprotein 



32 



<NONE> 



33 



<NONE> 



<NONE> 



34 



<NONE> 



<NONE> 



<NuNb> 



HSLTGFBP4 
1 



Human hepatitis virus 
(genotype C, HMA) 
preSl,preS2, S, C, X, 
antigens, core antigen, 
X protein and 
polymerase 



<NONE> I S74909 



|grn_mous 

E 



<NONE> 



[CA2FMOUS 
E 



<NONE> 



<NONE> |MMMHC29N 
7 2 



166 



Homo sapiens mRNA 
for latent transforming 
growth factor-beta 
binding protein-4; 
Latent TGF-beta 
binding protein-4 
transposase - 
Synechocystis sp. (PCC 
6803)>GP:D90909_10 
8 Synechocystis sp; 
PCC6803 complete 
genome, 1 1/27, 
1311235- 1430418; 
Transposase; 
ORF_ID:slr206 2 
GRANULINS 
PRECURSOR 
(ACROGRANIN)>GP: 
MUSAP_1 Mouse gene | 
for acrogranin 
precursor, complete cds I 



P 

VALUE 



0.0014 



0.00093 



0.00061 



PROCOLLAGEN 
ALPHA 2(1) CHAIN 
PRECURSOR>PIR2:A 
43291 collagen alpha 
2(1) chain precursor - 
mouse>GP:MMCOL 1 
A2J Mouse COL1A2 
mRNA for pro-alpha- 
2(1) collagen 



0.00016 



Mus musculus major | 8.00E-05 
histocompatibility 
locus class III 

region:butyrophilin-like 
protein gene, partial 
cds; Notch4, PBX2, 
RAGE, lysophatidic 
acid acyl transferase- 



BNSDOCID: <WO 9933982A2J. 
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WO 99/33982 

Table 2 



-. r 


Nearest 1 

Neighbor 

BlastN vs. 

^enbank) 1 

1 

] 


Nearest 
Neighbor 
BlastX vs. 
Son- 
Redundant 
Proteins) 




SEQ . 
ID 


\CCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 












alpha, palmitoyl- 




36 


<NONE> 


<NONE> 


<TMONE> 


NFH_RAT 


NEUROFILAMENT 
TRIPLET H PROTEIN 
(200 KD 

NEUROFILAMENT 
PROTEIN) (NF-H) 
(FRAGMENT) 


2.40E-05 


37 


<NONE> 


<NONE> 


<NONE> 


HUMVWFM_ 
1 


Human von Willebrand 
factor mRNA, 3' end; 
Von Willebrand factor 
prepropeptide 


1.70E-05 


38 


<NONE> 


<NONE> 


<NONE> 


CGHU2E 


collagen alpha 2(XI) 
chain - human 
(fragment) 


2.00E-06 


39 


<NONE> 


<NONE> 


<NONE> 


A6U83 


hypothetical protein 
(sdsB region) - 
Pseudomonas sp. 


4.90E-08 


40 


<NONE> 


<NONE> 


<NONE> 


YM8L YEAS 
T 


HYPOTHETICAL 71.1 
KD PROTEIN IN 
DSK2-CAT8 
INTERGENIC 
REGION>PIR2:S5458 
5 hypothetical protein 
YMR278w - yeast 
(Saccharomyces 
cerevisiae)>GP:SC802 
lA 4 DjCerevisiac 
chromosome XIII 
cosmid 8021; 
Unknown; YM8021;04, 
unknown, len: 622, 
CAI:0;16, 


1.50E-09 


41 


<NONE> 


<NONE> 


<NONE> 


MTCY210_31 


Mycobacterium 
tuberculosis cosmid 
Y210; Unknown; 
MTCY210;31, 
unknown, len: 299 aa, 
slight similarity to 
carboxykinases 


3.10E-10 
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Table 2 
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Nearest 
Neighbor 
(BlastN vs. 
Gen bank) 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 



SEQ 
ID 



42 



<NONE> 



DESCRIPTION 



<NONE> 



P 

VALUE 



ACCESSION 



<NONE> 



CEC01G10_5 



43 



<NONE> 



DESCRIPTION 



<NONE> 



<NONE>| HSU 15779 J 



Caenorhabditis elegans 
cosmid C01G10, 
complete sequence; 
C01G10;8; CDNA EST 
CEMSC45R comes 
from this 

gene>GP:CEC01G10_ 
5 Caenorhabditis 
elegans cosmid 
COIGiG; C0JGlU;8; 
CDNA EST 
CEMSC45R comes 
from this gene 
Human p70 (ST5) 
mRNA, alternatively 
spliced, complete cds; 
Differentially 
expressed; alternatively 
spliced 



P 

VALUE 



2.30E-12 



45 



46 



U61403 



Z92832 



47 



L36557 



<NONE> 



<NONE> I MTCY2 1 0_3 1 



Dictyostelium 
discoideum PrlA 
(prlA) mRNA, 
partial cds. 
Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 

F31D4; HTGS 

phase 1. 

Oryza sativa 

(clone pRG3) 

repetitive 

element. 



U93472 1 



HSU61262 1 



Mycobacterium 
tuberculosis cosmid 
Y210; Unknown; 
MTCY210;31, 
unknown, Ien: 299 aa, 
slight similarity to 
carboxykinases 

Danio rerio PPARB 

gene, partial cds; 

Nuclear receptor C 

domain 

Danio rerio PPARB 

gene, partial cds; 

Nuclear receptor C 

domain 



Human neogenin 
mRNA, complete cds 



1.70E-17 



0.95 



0.94 



0.89 
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Table 2 





Nearest J 

Neighbor 

[BlastN vs. 

Genbank) 


Nearest 
Neighbor 
[BlastX vs. 
Non- 
Redundant 
Proteins) 




SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


48 


AF005898 

• 


Homo sapiens 

Na,K-ATPase 

:>eta-3 subunit 

pseudogene, 

complete 

sequence. 


1 


LRP1_CHICK 


LOW-DENSITY 
LIPOPROTEIN 
RECEPTOR- 
RELATED PROTEIN 
1 PRECURSOR (LRP) 
(ALPHA-2- 
MACROGLOBULIN 
RECEPTOR) 
(A2MR)>PIR2:A53102 
LDL receptor-related 
protein / alpha-2- 
macroglobulin receptor 
precursor - 

chicken>GP:GGLRPA 
2MR 1 G;gallus 
mRNAfor LRP/alp 


0.85 


49 


U 18795 


Saccharomyces 
cerevisiae 
chromosome V 
cosmids 9669, 
8334, 8199, and 
lambda clone 
1160. 


1 


NKC1 SQUA 
C 


BUMETANIDE- 
SENSITIVE SODIUM- 
(POTASSIUM)- 
CHLORIDE 
COTRANSPORTER 2 
(NA-K-CL 

SYMPORTER)>PIR2: 
A53491 bumetanide- 
sensitive Na-K-Cl 
cotransporter - spiny 
dogfish>GP:SANKCC 
1 1 Saualus acanthi as 
bumetanide-sensitive 
Na-K-Cl cotransport 
protein (NKCC 


0.73 


50 


AC002523 


Homo sapiens; 
HTGS phase 1, 
54 unordered 
pieces. 


1 


BXEN CLOB 
O 


BOTULINUM 
NEUROTOXIN TYPE 
E, NONTOXIC 
COMPONENT>GP:C 
LOENT120J 
C;botulinum gene for 
nontoxic component of 
progenitor toxin, 
complete cds 


0.71 
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I Nearest 
I Neighbor 
|(BIastN vs. 
iGenbank) 



SEQ 
ID 



I ACCESSION 



51 I AC002345 



DESCRIPTION 



*** 

SEQUENCING 
IN PROGRESS 
*** Genomic 
sequence from 
Human 17; 
HTGS phase 1, 
10 unordered 
pieces. 



P 

VALUE 



I Nearest 
j Neighbor 
(BlastX vs. 
|Non- 

[Redundant 
[Proteins) 



(ACCESSION 



1 



P3K2 DICDI 



DESCRIPTION 



PHOSPHATIDYLINO 
SITOL 3-KINASE 2 
(EC 2.7.1.137) (PI3- 
KINASE) (PTDINS-3- 
KINASE) 

(PI3K)>GP:DDU23477 
_1 Dictyostelium 
discoideum 
phosphatidyliriositol- 
4,5-diphosphate 3- 
kinase (PIK2) mRNA, 
complete cds 



P 

VALUE 



0.58 



Human mRNA 
for cripto protein 



155651 



noradrenaline 
transporter - 
bovine>GP:BTU09 1 98 

1 Bos taurus 
noradrenaline 
transporter mRNA, 
complete cds 



0.55 



Caeriorhabditis 
elegans cosmid 
B0416. 



169024 



MHC sex-limited 
protein - mouse 
(fragment)>GP:MUSM 
HC4ADJ Mouse class 
III H2-Slp sex-limited 
protein gene, exons 1,2 
and 3; MHC sex 
limited protein 



0.47 



54 I AB006698 



Arabidopsis 
thaliana genomic 
DNA, 

chromosome 5, 
PI clone: 
MCL19. 



S81293 1 



55 



K03458 



Human 

immunodeficienc 

virus type I, 
isolate Zaire 6, 
vif, tat, rev, env, 
nef genes and 3' 
LTR. 



S13383 



LI {insertion sequence, 
provirus} [human 
papillomavirus type 6b 
HPV6b, KP4, Genomic 
Mutant, 121 nt]; 
Authors note this 
reading frame results 
from a 454 bp deletion 
and resulting 



hydroxyproline-rich 
glycoprotein - sorghum 



0.25 



0.24 
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\ . ;' 


Nearest (Nearest 
Neighbor Neighbor 
[BiastNvs. 1 (BlastXvs. 

Genbank) | Non - 

1 Redundant 

1 Proteins) 




SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 1 ACCESSION 
VALUE 1 


DESCRIFilOIN 


P 

VALUE 


56 


B26794 


T1016TR TAMU 
Arabidopsis 
thaliana genomic 
clone T1016. 


1 IRK34 PORP 
U 


CHLOROPLAST SOS 

RIBOSOMAL 

PROTEIN 

L34>PIR2:S73111 

ribosomal protein L34 - 

red alga (Porphyra 

purpurea) 

chloroplast>GP:PPU38 
804_4 Porphyra 
purpurea chloroplast 
genome, complete 
sequence; SOS 
ribosomal protein L34 


0.021 


57 


Z98950 


Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
jv ii 13, ni vjo 
phase 1. 


1 J D41132 


collagen-related protein 
4 - Hydra 
magnipapillata 
(fragment)>PIR2:S219 
32 mini-collagen - 
Hydra 

sp.>GP:HSNCOL4_l 
HydraN-COL4 
mRNA for mini- 
collagen; No start 
codon 


0.02 


58 


U57057 


Human WD 
protein IR10 
mRNA, complete 
cds. 


1 |DMU15602_1 


Drosophila 
melanogaster (zeste- 
white 4) mRNA, 
complete cds; Similar 
to C; elegans B0464;4 
gene product, Swiss- 
Prot Accession Number 
Q03562 


0.019 


59 


U57057 


Human WD 
protein IR10 
mRNA, complete 
cds. 


1 1 CR2JVIOUSE 


COMPLEMENT 
RECEPTOR TYPE 2 
PRECURSOR (CR2) 
(COMPLEMENT C3D 
RECEPTOR)>PIR2:A4 
3526 complement 
C3d/Epstein-Barr virus 
receptor 2 precursor - 
mouse>GP:MUSCR2A 
A_l Murine 
complement receptor 
type 2 (CR2) mRNA, 
complete cds; 
Complement receptor 
type 


0.0074 
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XT ^ j. 

Nearest 
Neighbor 

• ^uiasti> vs. 

: Genbank) 




Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSIOJN 


\ DESCRIFriON J P 

1 VALUE 


ACCESSIOIN 


f DESCRIPTION 1 P 

1 VALUE 


ou 


B65337 


CIT-HSP- 
2021H21.TF 
CIT-HSPHomo 
sapiens genomic 
clone 2021H21. 


1 


A38096 


perlecan precursor - 1 0.005 1 
human>GP:HUMHSP 
G2B_1 Human heparan 
sulfate proteoglycan 
(HSPG2) mRNA, 
complete cds 


61 


U84722 


Human vascular 
endothelial 
cadherin mRNA, 
complete cds 


1 


HSTAFI113J 


H;sapiens mRNA for 0.00 1 2 
TAFII135;Subunitof 
RNA polymerase II 
transcription factor 1 
TFIID J 


62 


L41493 


/\vian rotavirus 
(strain turkey 1) 
genomic segment 
4 outer capsid 
protein (VP8*) 
gene. 


1 


Y328 MYCP 
N 


HYPOTHETICAL 0.00015 

PROTEIN MG328 

HOMOLOG>PIR2:S73 

693 MG328 homolog 

P01_orfl033 - 

Mycoplasma 

pneumoniae (ATCC 

29342) 

(SGC3)>GP:MPAE000 
035_2 Mycoplasma 
pneumoniae from bases 
442306 to 452472 
(section 35 of 63) of 
the complete genome; 
MG328 homolog, 


63 

_ 


D63139 


Aeromonas sp. 
gene for 
chitinase, 
complete and 
partial cds. 


1 


MTCY16B7_3 

1 
] 
i 


Mycobacterium 6.30E-05 

tuberculosis cosmid 

SCY16B7; Unknown; 

MTCY16B7;03, 

initiation factor, len: 

900, similar at C- 

terminal half to eg 

[F2_BACSUP17889 

nitiation factor if-2 

716 aa), fasta | 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 



I Nearest 
I Neighbor 

(BlastX vs. 

Non- 

I Redundant 
1 Proteins) 
ACCESSION 



SEQ 
ID 



ACCESSION 



64 



J04974 



Human alpha-2 
type XI collagen 
mRNA 
(COL11A2). 



DESCRIPTION 



P 

VALUE 



1 



|GDF6_BOVI 
N 



DESCRIPTION 



P 

VALUE 



GROWTH/D1FFEREN 
TIATION FACTOR 
GDF-6 PRECURSOR 
(CARTILAGE- 
DERIVED 
MORPHOGENETIC 
PROTEIN 2) (CDMP- 
2) 

(FRAGMENT)>PIR2: 
B55452 cartilage- 
derived morphogenetic 
protein 2 precursor - 
bovine 

(fragment)>GP:BTU13 
66 1_1 Bos taurus 
cartilage-derived morp 



1.00E-05 



65 I AC002394 



Homo sapiens 
Chromosome 16 
BAC clone 
CIT987-SKA- 
211C6 complete 
genomic 
sequence 
complete 
sequence 



ICELC14F11 
6 



Caenorhabditis elegans I 4.60E-06 
cosmidC14Fll; 
Similar to aspartate 
aminotransferase; 
coded for by C; elegans | 
cDNA CEMSF95FB; 
coded for by C; elegans I 
cDNA yk41e4;3; coded | 
for by C; elegans 



66 I AB002312 



Human mRNA 
for KIAA0314 
gene, partial cds. 



67 I AC003085 



|NAT1_YEAS 
T 



N-TERMINAL 
ACETYLTRANSFER 
ASE1 (EC 2.3.1.88) 
(AMINO-TERMINAL, 
ALPHA- AMINO, 
ACETYLTRANSFER 
ASE 1) 



1.00E-09 



Human BAC 
clone RG094H21 
from 7q21-q22, 
complete 
sequence. 



I DP 19 CAEEL 



DPY-19 
PROTEIN>PIR2:S446 
29 f22b7. 10 protein - 
Caenorhabditis 
elegans>GP:CELF22B 
7_9 C;aenorhabditis 
elegans (Bristol N2) 
cosmid F22B7; 
Putative 



4.20E-11 



68 



X55026 



P.ansenna 
complete 
mitochondrial 
genome. 



[NATTYEAS 
T 



N-TERMINAL 
ACETYLTRANSFER 
ASE 1 (EC 2.3.1.88) 
(AMINO-TERMINAL, 
ALPHA- AMINO, 
ACETYLTRANSFER 
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• * '.' ' ' '• 


Nearest 
Neighbor 
(BlastN vs. 
Genbank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 




SEQ 
ID 


Af CFSSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


72 


U67548 


Methanococcus 
jannaschii from 
bases 9862 19 to 
996377 (section 
90 of 150) of the 
complete 
genome. 


0.99 


YB60 YEAS 
T 


HYPOTHETICAL 16.3 
KD PROTEIN IN 
DUR1,2-NGR1 
INTERGENIC 
REGION>PIR2:S4608 
4 probable membrane 
protein YBR210w- 
yeast (Saccharomyces 
cerevisiae)>GP:SCYB 
R210W_1 S;cerevisiae 
chromosome II reading 
frame ORF YBR210w 


1 


73 


U51645 


Plasmodium 
falciparum 
cytidine 
triphosphate 
synthetase gene, 
complete cds. 




UPQVT? PT 1 
or o V rvrLr_l 


Oil! INOIIIUIC VII UO ^i^iti 

H10)RNAL segment 
encoding RNA 
polymerase (L protein), 
complete cds; Viral 
RNA nolvm erase (L 
protein); 

Putative>GP:HPSVRP 
LA 1 SinNombre 
virus (NMRll)RNA 
L segment encoding 
RNA polymerase (L 
protein), complete cds; 
Vir 


0.99 


74 


Z49889 


Caenorhabditis 
elegans cosmid 
T06H11, 
complete 
sequence. 


0.99 


MUSHDPRO 
B_l 


Mouse alternatively 
spliced HD protein 
mRNA, complete cds 


0.021 


75 


Z69374 


Human DNA 
sequence from 
cosmid L174G8, 
Huntington f s 
Disease Region, 
chromosome 
4pl6.3 contains a 
pair of ESTs. 


0.99 


NCPR YEAS 
T 


NADPH- 

CYTOCHROME P450 
REDUCTASE (EC 
1.6.2.4) (CPR) 


0.017 
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. Neighbor 

/T>1 ■j.Tfc.T 

(tflastiN vs. 
Genbank) 




Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSIOI* 


i DESCRIPTION 


f P 
VALUE 


ACCESSIOIN 


[ DESCRIPTION 


P 

VALUE 


76 


Z35847 


S.cerevisiae 
chromosome II 
reading frame 
ORF YBL086c. 


0.99 


CYPA CAEE 
L 


PEPTIDYL-PROLYL 
CIS-TRANS 

loUMfcKAofc, 10 (EC 
5.2.1.8) (PPIASE) 
(ROTAMASE) 
(CYCLOPHILIN- 
10)>GP:CELB0252_4 
Caenorhabditis elegans 
cosmid B0252; Similar 
to peptidyl -prolyl ch>- 
trans isomerase 

/DDT A CD\ 

(rriAbb) 

(CYCLOPHILIN)>GP: 
CEU34954_1 
Caenorhabditis el 


0.0044 


77 


L35330 


Rattus norvegicus 
glutathione S- 
transferase Yb3 
subunit gene, 
complete cds. 


0.99 




Caenorhabditis elegans 
cosmid R148; Contains 
similarity to drosophila 
DNA-binding protein 

coded for by C; elegans 
cDNAykll8ell;5; 
coded for by C; elegans 
cDNA 


0.0032 


78 


Y00324 


Chicken 

vitellogenin gene 
3' flanking 
region. 


0.99 


A56922 


transcription factor shn 
- fruit fly (Drosophila 
melanogaster) 


0.0023 


79 


M32659 


D.melanogaster 
Shabll protein 
mRNA, complete 
cds. 


0.99 


OMU25146J 


Oncorhynchus mykiss 
recombination 
activating protein 2 
gene, partial cds 


0.0017 


80 


Z69880 


H.sapiens 
SERCA3 gene 
partial). 


0.99 


M84D DRO 
ME 

i 

] 
] 
1 
{ 
F 


MALE SPECIFIC 
SPERM PROTEIN 
MST84DD>PIR2:S257 
75 testis-specific 
protein Mst84Dd - fruit 
fly (Drosophila 
meIanogaster)>GP:DM 
MST84DJ* 
D;melanogaster 
VIst84Da, Mst84Db, 
Wst84Dc and Mst84Dd 
>enes for put; sperm 
>rotein 


0.0011 
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* 


Nearest 
Neighbor 
BlastN vs. 
Genbank) 






Nearest 

Ivlgll UUI 

[BlastX vs. 
Non- 

Redundant 
Proteins) 






SEQ 
ID 


ACCESSION 




p 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


81 


M99166 


Escherichia coli 
Trp repressor 
DinQing proiciii 
(wrbA) gene, 
complete cds. 


0.99 


MTU88962J 


Mycobacterium 
tuberculosis unknown 
protein gene, partial cds 


6.50E-07 


82 


X99257 


R.norvegicus 
mRNA for lamin 
C2. 


0.99 


MIU68729J 


Meloidogyne incognita 
cuticle preprocollagen 
(col-2) mRNA, 
complete cds; Putative 


1.60E-09 


83 


AC002432 


Human BAC 
clone RG317G18 
from 7q31, 
complete 
sequence. 


0.98 


1FMDC 


Foot and mouth disease 
virus type c-s8cl, chain 
C - foot and mouth 
disease virus type c- 
SoCi exprcbbcu in 
hamster kidney cells 


0.14 


84 


Z34799 


Caenorhabditis 
elegans cosmid 
F34D10, 
complete 
sequence. 


0.98 


MMU57368_1 


Mus musculus EGF 
repeat transmembrane 
protein mRNA, 
complete cds; Notch 
like repeats; notch 2 


0.0028 


85 


B 15207 


344E15.TV 
CIT978SKA1 
Homo sapiens 
genomic clone A- 
344E15. 


0.98 


POLG HCVJ 
6 


GENOME 
POLYPROTEIN 
(CONTAINS: CAPSID 
PROTEIN C (CORE 
PROTEIN); MATRIX 
PROTEIN 
(ENVELOPE 
PROTEIN M); MAJOR 
ENVELOPE 
PROTEIN E; 
NONSTRUCTURAL 
PROTEINS NS1,NS2, 
NS4A AND NS4B; 
HELICASE(NS3); 
RNA-DIRECTED 
RNA POLYMERASE 
(EC 2.7.7.48) 
(NS5))>PI 


0.00083 
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87 



X57010 



M83093 



Human COL2A1 
gene for collagen 
II alpha 1 chain, 
exons E2-E1S. 
Neurospora 
crassa cAMP- 
dependent protein 
kinase (cot-1) 
gene, complete 
cds. 



89 



U96271 



0.98 



D80005 1 



: : :V 1 Neighbor 

ift-^. : - : j( BIastN vs - 

Genbank) 




Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ (ACCESSION 
ID 1 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


86 | AC0024I2 


*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome X; 
HTGS phase 1,2 
unordered pieces. 


0.98 | 


KDG1 ARAT 
H 


DIACYLGLYCEROL 
KINASE 1 (EC 
2.7.1.107) 
(DIGLYCERIDE 
KINASE) (DGK 1) 
(DAG KINASE 
1)>PIR2:S71467 
diacylglycerol kinase 
(EC 2.7.1.107) 


0.00024 



0.98 |YA53_SCHP 
O 



Arabidopsis 
thaliana>GP:ATHATD 
GK1_1 Arabidopsis 
thaliana mRNA for 
diacylglycerol kinase, 
complete c 



Human mRNA for 
KIAA0183 gene, 
partial cds 



Helicobacter 
pylori heat shock 
protein 70 
(hsp70) gene, 
complete cds. 
Caenorhabditis 
elegans cosmid 
C39E6. 



0.97 I SLMEN6 1 



HYPOTHETICAL 24.2 
KD PROTEIN 
C13A11.03IN 
CHROMOSOME 
I>GP:SPAC13A11_3 
S;pombe chromosome I 
cosmid C13A11; 
Unknown; 
SPAC13A11;03, 
unknown, len:210 



S;latifolia mRNA for 
Men-6 

protein>GP:SLMEN6_ 
1 S;latifolia mRNA for 
Men-6 protein 



5.90E-10 



3.00E-22 



0.43 



0.97 |RON_HUMA 
N 



MACROPHAGE 
STIMULATING 
PROTEIN RECEPTOR 
PRECURSOR (EC 
2.7.1.1 12)>P1R2:I3818 
5 protein-tyrosine 
kinase (EC 2.7.1.112), 
receptor type ron - 
human>GP:HSRONJ 
H;sapiens RON mRNA 
for tyrosine kinase; 
Putative 



0.034 
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iGenbank) 
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Table 2 



I Nearest 
I Neighbor 
|(BlastX vs. 
|Non- 

I Redundant 
I Proteins) 



SEQ |ACCESS1UN 
ID 



91 



Y09255 



92 I AC00241J 



DESCRIPTION 



93 



U41625 



94 



Z82202 



B.cereus dnal 
gene, partial. 



*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome X; 
HTGS phase 1,2 
unordered pieces. 



Caenorhabditis 
elegans cosmid 
K03A1. 



P 

VALUE 



ACCESSION I DESCRIPTION 



P 

VALUE 



Human DN A 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
34P24;HTGS 
phase 1 . 



"097 ICELT05C1_5 



"0^6 I CELC44E4_5 



0.96 



|hmgc_hum 

AN 



0.96 



|YTH3_CAEE 
L 



Caenorhabditis elegans 
cosmid T05C1; Coded 
for by C; elegans 
cDNA yk30f6;3; coded 
for by C; elegans 
cDNA yk34f!0;3 
Caenorhabditis elegans 
cosmid C44E4; Weak 

similarity to the 

drosophila hyperplastic 

disc protein 

(GB:L 14644); coded 

for by C; elegans 

cDNA yk49h6;5; coded 

for by C; elegans 

cDNA 

HIGH MOBILITY 

GROUP PROTEIN 

HMGI- 

OPIR2JC2232 high 
mobility group I-C 
phosphoprotein - 
human>GP:HSHMGIC 
G5_l Human high- 
mobility group 
phosphoprotein isoform 
I-C (HMG1C) gene, 
exon 

5>GP:HSHMGICP_1 
H;sapiens mRNA for 
HMG1-C 

protein>GP:HSHMGIC 
HYPOTHETICAL 75.5 
KD PROTEIN 
C14A4.3 IN 
CHROMOSOME 
II>GP:CEC14A4_3 
Caenorhabditis elegans 
cosmid C14A4, 
complete sequence; 
C14A4;3; Weak 
similarity with a B; 
Flavum translocation 
protein (Swiss Prot 
accession number 
P38376) 



0.00043 



0.73 
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(GRK5) mRNA, 
complete cds. 



X97384 



A.thaliana atran3 
gene. 

Human C5a 
anaphylatoxin 
receptor mRNA, 
complete cds. 



0.95 



<NONE> 



Homo sapiens (clones: 
CW52-2, CW27-6, 
CW15-2,CW26-5, 11- 
67) collagen type VII 
intergenic region and 
(COL7Al)gene, 
complete cds 



<NONE> 



99 



D28778 



Cucumber mosaic | 
J virus RNA 1 for 

la, complete 
I sequence. 



^95 |kjpb_brydi Iribosome- 

INACTIVATING 
PROTEIN BRYODIN 
(RRNA N- 

GLYCOSIDASE)(EC 
3.2.2.22) 

CFRAGMENT)>PIR2: 
SI 6491 rRNAN- 
glycosidase (EC 
3.2.2.22) bryodin-red 
bryony (fragment) 



<NONE> . 



0.83 



|POLS_RUBV 
M 



iUU I AFO 16202 



Homo sapiens 
immunoglobulin 
heavy chain 
CDR3 gene, 



"0^3 I HSU79716 1 



180 



STRUCTURAL 
POLYPROTEIN 
(CONTAINS: 
NUCLEOCAPSID 
PROTEIN C; 
MEMBRANE 

GLYCOPROTEINS El 
AND 

E2)>PIR1:GNWVR3 
structural polyprotein - 
rubella virus (strain 
M33)>GP:TORUB24S 
_I Rubella virus 24S 
subgenomic mRNA for j 
structural proteins El, 

E2 and C; 

Human reelin (RELN) 
mRNA, complete cds 



0.00037 
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I Nearest 
1 Neighbor 
|(BlastN vs. 
I Gen bank) 



I Nearest 
I Neighbor 
KBlastX vs. 
|Non- 

I Redundant 
1 Proteins) 



SEQ | ACCESSION 
ID 



DESCRIPTION 



partial cds. 



P 

VALUE 



I ACCESSION 



P 

VALUE 



101 



Z68303 



Caenorhabditis 
legans cosmid 
ZK809, complete 
sequence. 



0.93 



|HS5HT4SAR_ 
1 



H;sapiens mRNA for 
serotonin 4SA receptor 
(5-HT4SA-R) 



102 



X03049 



E. coli DNA 
sequene 5' to 
origin of 
replication oriC. 



0.93 



S37594 



mucin - human 
(fragment) 



103 



M32659 



D.melanogaster 
Shabl 1 protein 
mRNA, complete 
cds. 



0.93 



S38480 



nonstructural protein - 
rubella 

virus>GP:RVM33NP_ 
1 Rubella virus M33 
RNA for a 

nonstructural protein; 
Nonstructural protein 
genes 



2.30E-06 



104 



D88687 



Human mRNA 

forKM-102- 

derived 

reductase-like 

factor, complete 

cds. 



0.93 



|BAT3_HUMA 
N 



LARGE PROLINE- 
RICH PROTEIN BAT3 
(HLA-B- 
ASSOCIATED 
TRANSCRIPT 
3)>PIR2:A35098 MHC 
class III 

histocompatibility 
antigen HLA-B- 
associated transcript 3 - 
human>GP:HUMBAT 
3A_1 Human HLA-B- 
associated transcript 3 
(BAT3) mRNA, 
complete 

cds>GP:HUMBAT3 



105 



D16847 



Mouse mRNA for 
stromal cell 
derived protein- 1, 
complete cds. 



0.93 



S52796 



prpL2 protein - human 
(fragment)>GP:HSPRP 
L2_l H;sapiens mRNA 
for PRPL-2 protein 
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:':£v: 



SEQ 
ID 



106 



Nearest 
Neighbor 
(BlastN vs. 
Genbank) 



ACCESSION 



D90915 



107 AJ001101 



X57108 



DESCRIPTION 



Synechocystis sp 

PCC6803 

complete 

genome, 17/27, 

2137259- 

2267259. 



Mus musculus 
mRNA for 
gClqBPgene. 



[Nearest 
Neighbor 
(BlastXvs. 
Non- 
Redundant 
Proteins) 

p (accession 

VALUE 



DESCRIPTION 



P 

VALUE 



"0^92 IdMU58282J 



Human gene for 
cerebroside 
sulfate activator 
protein, exons 10- 
14. 



0.92 | Y*Ky_YEAS I HYPOTHETICAL 53.9 1 5.90E-05" 
T KD PROTEIN IN 

AFG3-SEB2 
INTERGENIC 
REGION>PIR2:S5047 
7 hypothetical protein 
YER019w- yeast 
I (Saccharomyces 
cerevisiae)>GP:SCE95 
37_20 Saccharomyces 
cerevisiae chromosome 
Vcosmids 9537, 9581, 
9495, 9867, and 
lambda clon e 5898 
Drosophila 
melanogaster Bowel 
(bowl) mRNA, 
complete cds; 
Transcription factor; 
C2H2 zinc finger 
protein; zinc fingers 
have extensive 
sequence similarity to 
Drosophila odd- 

skipped 

hypothetical protein 
YPR144c- yeast 
(Saccharomyces 
cerevisiae)>GP:YSCP9 
659_1 7 Saccharomyces 
cerevisiae chromosome 
XVI cosmid 9659; 
Yprl44cp; Weak 
similarity near C- 
terminus to RNA 
Polymerase beta 
subunit (Swiss Prot; 
accession number 
P11213) 



0.92 



S69032 
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Table 2 



PO7US98/27610 



I Nearest 
I Neighbor 
KBlastN vs. 
I Gen bank) 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 



SEQ (ACCESSION 
ID 



D14635 



Caenorhabditis 
elegans DNA for 
EMB-5. 



DESCRIPTION 



P 

VALUE 



ACCESSION 



0.91 



YM13YEAS 
T 



PUTATIVE ATP- 
DEPENDENTRNA 
HELICASE 
YMR128W>PIR2:S53 
058 probable 
membrane protein 
YMR128w- yeast 
(Saccharomyces 
cerevisiae)>GP:SC955 

4 S;cerevisiae 
chromosome XIII 
cosmid 9553; 
Unknown; YM9553;04, 
probable ATP- 
dependent RNA 
helicase, len: 



DESCRIPTION 



P 

VALUE 



0.69 



B55500 



C1T-HSP- 
387J2.TFB CIT- 
HSP Homo 
sapiens genomic 
clone 3 87 J2. 



0.91 



U97553 79 



Murine herpesvirus 68 
strain WUMS, 
complete genome; 
Unknown 



0.00016 



X03049 



E. coli DNA 
sequene 5* to 
origin of 
replication oriC. 



0.9 



POL MLVAV 



POL POLYPROTEIN 
(PROTEASE (EC 
3.4.23.-); REVERSE 
TRANSCRIPTASE 
(EC 2.7.7.49); 
RIBONUCLEASE H 
(EC 

3.1.26.4))>PIR1:GNM 
VG V pol polyprotein - 
AKV murine leukemia 
virus 



0.0019 



U91327 



Human 
chromosome 
12pl5 BAC clone 
CIT987SK-99D8 
complete 
sequence. 



0.89 



JC5568 



serine protease (EC 
3.4.-.-) hi -Serratia 
marcescens 



X13295 



Rat mRNA for 
alpha-2u 
globulin-related 
protein. 



0.89 



MNGPOLY 1 



Mengo virus 
polyprotein genome, 
complete cds withe 
repeats 



Z78415 



Caenorhabditis 
elegans cosmid 
C17G1, complete 
sequence. 



0.89 



AB000121 



Mouse mRNA for 
TBPIP, complete cds; 
TBP1 interacting 
protein 



0.39 
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Table 2 



PCT7US98/27610 



;■ 


rNearesi 
: : Neighbor 
r (BlastN vs. 

• Gen bank) 

[' 

■•1 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


S£Q 
ID 


ALL JfcSMUIN | DESCRIPTION 


P 

VALUE 


ACCESSKW 


i DESCRIPTION 


P 

VALUE 


115 


AC002308 1*** 

SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome 
22qll BAC 
Clone 1000e4; 
HTGS phase 1, 
26 unordered 
1 nieces. 


0.88 


YLK2 CAEE 
L 


HYPOTHETICAL 
122.7 KD PROTEIN 
D1044 1 IN 

CHROMOSOME 
III>GP:CELD1044_4 
Caenorhabditis elegans 
cosmid D1044 


0.0037 


116 


AC002073 I Human PAC 
clone DJ515N1 
from 22ql 1.2- 
q22, complete 
sequence. 


0.88 


S28499 


probable finger protein 
- rat>GP:RNZFP_l 
R;norvegicus mRNA 
for putative zinc finger 
protein 


I.10E-31 


117 


283848 Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
57A13;HTGS 
phase 1. 


0.87 


E 


ofcKlNb PROTEASE 
NUDEL PRECURSOR 
(EC 3.4.21.- 
)>PIR2:A57096 nudel 
protein precursor - fruit 
fly (Drosophila 
me!anogaster)>GP:DM 
U29I53_1 Drosophila 
melanogaster nudel 
(ndl) mRNA, complete 
cds; Serine protease; 

Sftm?) HpnpnHAnt nnna 

required matern 


1 


118 


U23449 Caenorhabditis 
elegans cosmid 
K06A1. 


0.87 


AF023268_3 

1 
( 


Homo sapiens clk2 
kinase (CLK2), 
propinl, cotel, 
glucocerebrosidase 
(GBA), and metaxin 
genes, complete cds; 
metaxin pseudogene 
and glucocerebrosidase 
pseudogene; and 
thrombospondin3 
THBS3) 


0.21 


119 


Z68181 H.vulgaris 
mRNA for 
elongation factor 
|eF1 -alpha. 


0.87 I 


4ABCY450C 1 
_1 <■ 
i 


labbit cytochrome P- 
150 gene, clone pP- 
0PBc3, 3* end 


0.14 
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[Nearest 
{Neighbor 
|(BlastN vs. 
iGenbank) 



PCT/US98/27610 



Table 2 



[Nearest 
(Neighbor 
|(BlastXvs. 
|Non- 

1 Redundant 
I Proteins) 



SEQ {ACCESSION 
ID 



DESCRIPTION 



120 I ACOOOU33 



Homo sapiens 
chromosome 9, 
complete 
sequence. 



121 



U23449 



122 



123 



Z89651 



Z94055 



P 

VALUE 



[ACCESSION 



0.87 



|VWF_CANF 
A 



Caenorhabditis 
elegans cosmid 
K06A1. 



DESCRIPTION 



P 

VALUE 



VON W1LLEBRAND 
FACTOR 

PRECURSOR>GP:DO 
GVWG_1 Canis 
familiaris von 
Willebrand factor 
mRNA, complete cds 



0.036 



0.86 | S48988J f CRP-1 =cystatin-related 
protein [rats, Wistar 
albino, mRNA Partial, 
213 nt]; Cystatin- 
related protein; 
Method: conceptual 
translation supplied by 
author; This sequence 
comes from Fig; 



F.rubripes GSS 
sequence, clone 
090I24cD5. 



Human DNA 
sequence from 
PAC 24M15on 
chromosome 1 . 
Contains 
tenascin-R 
(restrictin), EST. 



~Q$6 I CPU65981_1 



0.86 



|gltb_syny 

3 



124 



Z49250 



Human DNA 
sequence from 
cosmid HW2, 
Huntington's 
Disease Region, 
chromosome 
4pl6.3. 



0.86 



|TRSCAPS1D_ 
1 



Cryptosporidium 

parvum P-ATPase gene | 

(CppA-El)gene, 

complete cds; Putative 

calcium-ATPase 

FERREDOXIN- 

DEPENDENT 

GLUTAMATE 

SYNTHASE 1 (EC 
1.4.7.1) (FD- 
GOGAT)>PIR2:S6022 
8 glutamate synthase 
(ferredoxin) (EC 
1.4.7.1) gltB- 
Synechocystis sp. (PCC 
6803)>GP:D90902_66 
Synechocystis sp; 
PCC6803 complete 
genome, 4/27, 402290- 

524345; Gluta 

Tobacco ringspot virus 
capsid protein gene, 
complete cds 



0.64 



0.6 



0.03 
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PCT/US98/27610 



SEQ 
ID 



Nearest 
Neighbor 
(BlastN vs 
Gcnbank) 



ACCESSION 



125 



Z92855 




Caenorhabditis 
elegans DNA *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
Y48C3;HTGS 
phase 1. 



126 | AC002340 I*** 

I SEQUENCING 
IN PROGRESS 
*** Arabidopsis 
thaliana TAMU* 
BAC TllJT 
genomic 
sequence near 
marker ^283'; 
HTGS phase 1,2 
unordered piec es 

127 | AL008716 Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
206C7; HTGS 

[phase 1. 

128 J AC002340 *** 

SEQUENCING 
IN PROGRESS 
*** Arabidopsis 
thaliana 'TAMU 1 
BAC TllJT 
genomic 
sequence near 
marker , m283 t ; 
HTGS phase 1,2 
I unordered pieces. 



0.84 I AE000809_8 



Methanobacterium 
thermoautotrophicum 
from bases 161632 to 
172569 (section 15 of 
148) of the complete 
genome; Aspartyl-. 
tRNA synthetase; 
Function Code:10;07- 
Metabolism of 



0^83 IHIVU51189 5 



0.83 



S60257 



Caenorhabditis elegans 
cosmid T01E8, 
complete sequence; 
T01E8;3; Similar to 1- 
phosphatidylinositol- 
4,5-bisphosphate 
phosphodiesterase; 
cDNA EST CEESG02F 
comes from this gene; 



Hiv-l clone 93th253 
from Thailand, 
complete genome; Tat 
protein 



meltrin alpha 
mouse>GP:MUSMAB 

1 Mouse mRNA for 
meltrin alpha, complete 
cds 



0.86 



0.86 



0.0013 
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Table 2 



PCT/US98/27610 



[Nearest 
[Neighbor 
|(BlastN vs. 
iGenbank) 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 
ACCESSION 



*SEQ 
ID 



I ACCESSION I DESCRIPTION 



129 



Z83848 



j Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
57A13;HTGS 
phase 1 . 



P 

VALUE 



0.82 



|ARO!_PNEC 
A 



PENT AFUNCTIONAL 
AROM 

POLYPEPTIDE 
(CONTAINS: 3- 
DEHYDROQUINATE 
SYNTHASE (EC 
4.6.1.3), 3- 

DEHYDROQUINATE 
DEHYDRATASE (EC 

4.2.1.10) (3- 
DEHYDROQUINASE) 
, SHIKIMATE 5- 
DEHYDROGENASE 

(EC 1.1.1.25), 
SHIKIMATE KINASE 
(EC 2.7.1.71), AND 
EPSP SYNTHASE (E 



130 I AF029308 



I Homo sapiens 
chromosome 9 
duplication of the 
Tcell receptor 
beta locus and 
trypsinogen gene 
families. 



0.8 



CELZK84 5 



Caenorhabditis elegans 
cosmid ZK84; Final 
exon in repeat region; 
similar to long tandem 
repeat region of 
sialidase 

(SP:TCNA_TRYCR, 
P23253) and 
neurofilament H 
protein; coded for by C; 
elegans 



P 

VALUE 



0.0098 



2.00E-08 



131 I AC002458 



j Human BAC 
clone RG098M04 
from 7q21-q22, 
complete 
sequence. 



0.78 



IGF2 PIG 



INSULIN-LIKE 
GROWTH FACTOR II 
PRECURSOR (IGF- 
II)>GP:SSIGF2_1 
S;scrofamRNA IGF2 
for insulin-like-growth 
factor 2; Insulin- like- 
growth factor 2 
preproprotein 



132 



Z83843 



| Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
368A4; HTGS 
phase 1. 



0.78 



PARS 1 A 1 



P;tetraurelia51A 
surface protein gene, 
complete cds 
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"SEQ 
ID 



133 



Nearest 
Neighbor 
(BlastN vs. 
Gen bank) 



ACCESSION 



XO302J 



DESCRIPTION 



Human gene for 
granulocyte- 
macrophage 
colony 

stimulating factor 
(GM-CSF). 



S.cerevisiae 
chromosome XV 
reading frame 
ORF YOL083w, 



P 

VALUE 



0.78 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 



ACCESSION 



CEF57B1 3 



DESCRIPTION 



274825 



136 



137 



U80445 



Z78545 



S.cerevisiae 
chromosome XV 
reading frame 
ORF YOL083w. 



Caenorhabditis 
elegans cosmid 
C50F2. 



Caenorhabditis 
elegans cosmid 
M03B6, complete 
sequence. 



0.77 



Caenorhabditis elegans 
cosmid F57B1, 
complete sequence; 
F57B1;3; Protein 
predicted using 
Genefinder; similar to 
collagen 

PUTATIVE LEUCYL 
TRNA 

SYNTHETASE, 
MITOCHONDRIAL 
PRECURSOR (EC 
6. L 1 .4) (LEUCINE— 
TRNA 

LIGASE)>PIR2:S6248 
6 hypothetical protein 
SPAC4G8.09 - fission 
yeast 

(Schizosaccharomyces 
pombe)>GP:SPAC4G8 
__9 S;pombe 
chromosome I cosmid 
c4G8; Unknown; 
SPAC 



P 

VALUE 



2.20E-05 



0.96 



0.76 



[KNU59809J Rattus norvegicus 

mannose 6- 
phosphate/insulin-like 
growth factor II 
receptor (M6P/IGF2r) 
mRNA, complete cds; 
Also termed IGF- 
II/Man 6-P receptor, 
JMPR, CI-MPR 



0.01 



S28499 



"0?75 I RRU73586J 



I probable tinger protein 

rat>GP:RNZFPJ 
I R;norvegicus mRNA 
for putative zinc finger 

protein 

I Rattus norvegicus 
Fanconi anemia group 
C mRNA, complete 
cds; Fanconi anemia 
group C protein; 
Similar to human FAC 
protein, GenBank 
Accession Numbers 
(X66893 and X66894 



1.10E-31 



0.023 
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PCTAJS98/27610 



[Nearest 
I Neighbor 
l(BlastN vs. 
1 Gen bank) 



[Nearest 
I Neighbor 
|(BlastX vs. 
|Non- 

1 Redundant 
1 Proteins) 



SEQ |ACCESSION 
ID 



138 



Z97630 



139 I AF007269 



140 AC002393 



T4T1 B 15232 



T42~1 D13748 



DESCRIPTION 



Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
466N1;HTGS 
phase 1. 
Arabidopsis 
thaliana BAC 
IG002N01. 



P 

VALUE 



ACCESSION I DESCRIPTION 



0.74 



IHSMSHREC 
A 1 



~0T\ I HSU95090J 



Mouse 
BAC284H12 
Chromosome 6, 
complete 
sequence. 
344G8.TV 
CIT978SKA1 
Homo sapiens 
genomic clone A- 
344G08. 
Human mRNA 
for eukaryotic 
initiation factor 
4AI. 



T43| S45791 



band 3-related 
protein=renal 
anion exchanger 
AE2 homolog 
[rabbits, New 
Zealand White, 
ileal epithelial 
cells, mRNA, 
3964 nt]. 



144 I M22462 



Chicken protein 
p54 (ets-1) 



I H;sapiens mRNA for 
MSH receptor; Author- 
given protein sequence 
is in conflict with the 
conceptual translation 



Homo sapiens 
chromosome 19 cosmid 
F 19541, complete 
sequence; F19541__l; 
Hypothetical (partial) 
protein similar to 
1 proline oxidase 



1T7 I RNLTBP2_1 



T67 I DMSEVL2_2 



(Rattus norvegicus 
mRNA for LTBP-2 like 
protein; Latent TGF- 
beta binding protein-2 
like protein 
Drosophila 
melanogaster sevenless 
mRNA; Put; sevenless 
protein (AA 1 - 2510) 



VALUE 



0.036 



0.16 



4.40E-05 



0.41 



"066 IMMU53563J Mus musculus Brgl 

mRNA, partial cds; N- 
terminal region of the 
I protein 



0.66 



IPOLSJRUBV 
R 



0.66 



IHSHP8PR01 
1 



structural 
| polyprotein 
^contains: 
|nucleocapsid 
i protein c; 
membrane 

GLYCOPROTEINS El 
AND 

|E2)>PIR1:GNWVRA 
structural polyprotein 

| rubella virus (strain 
RA27/3 

vaccine)>GP:RUBCE2 
_1 Rubella virus 
, RA27/3 RN A for 
jcapsid, E2 and El 

proteins; Poly 

I H;sapiens mRNA for 
HP8 protein; HP8 



5.60E-05 
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ID 



145 



Nearest 
Neighbor 
(BlastN vs. 
Genbank) 



ACCESSION 



U27999 



DESCRIPTION 



P 

VALUE 



mRNA, complete 
cds. 

Human clone 
PDEL52A11 
HLA-C region 
cosmid 52 
genomic survey 
sequence. 



0.65 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 



ACCESSION 



CA18_HUMA 
N 



DESCRIPTION 



peptide 



146 I M54787 



I N.crassa mating 
type a- 1 protein 
(mt a-1) gene, 
exons 1-3. 



COLLAGEN ALPHA 
l(VIII) CHAIN 
PRECURSOR 
(ENDOTHELIAL 
COLLAGEN)>PIR2:S 
15435 collagen alpha 
l(VIII) chain precursor 

human>GP:HSCOL8A 
1_1 Human COL8A1 
mRNA for alpha 
l(VIII) collagen 



P 

VALUE 



5.70E-06 



147 I AC002094 



148 I U32701 



Genomic " 
sequence from 
Human 17, 
complete 
[sequence. 



0M 1 K0717 I vacuolar H+-ATPase A 
subunit - chicken 
(fragment)>GP:GGU22 
078 J Callus gallus 
vacuolar H+-ATPase A 
subunit gene, partial 
cds 



0.0046 



0.63 



fVPVAlJ P;vivaxpvalgene 



0.1 



Haemophilus 
influenzae from 
bases 165345 to 
176101 (section 
16 of 163) of the 
complete 
genome. 



0.63 



|FABG_HAEJ 
N 



149 



Z37159 



T.brucei serum 
resistance 
associated (SRA) 
mRNA for VSG- 
I like protein. 



0.61 



<NONE> 



J-UXOACYL-IACYL- 1 2.00E-12 
CARRIER PROTEIN] 
REDUCTASE (EC 
LI. 1.100) (3- 
KETOACYL- ACYL 
CARRIER PROTEIN 
REDUCTASE)>PIR2: 
D64051 3-oxoacyl- 
[acyl-carrier-protein] 
reductase (EC 
1.1.1.100)- 
Haemophilus 
influenzae (strain Rd 
KW20)>GP:HIU3270I 
_7 Haemophilus 
<NONE> 
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PCT/US98/27610 



[Nearest 
1 Neighbor 
(BlastN vs. 
I Gen bank) 



SEQ [ACCESSION 
ID 



DESCRIPTION 



150 I AF027865 



151 



U40938 



152 



153 



116670 



Z84468 



I Mus musculus 
Major 

Histocompatibilit 
y Locus class II 

I region. 

Caenorhabditis 
elegans cosmid 
D1009. 



154 



U00054 



155 I M21207 



I Sequence 1 from 
patent US 
5476781. 

Human DN A 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
299D3;HTGS 
I phase 1 . 



Caenorhabditis 
elegans cosmid 
K07E12. 



P 

VALUE 



Nearest 
1 Neighbor 

(BlastX vs. 
jNon- 

I Redundant 
1 Proteins) 
ACCESSION 



DESCRIPTION 



P 

VALUE 



0.61 



0.61 



A56514 



|YA53_SCHP 
O 



"039 I CELF21F8J7 



0.59 



|CLG1_YEAS 
T 



0.57 



<NONE> 



Synthetic SV4U i | 
antigen mutant 
pseudogene, 3 1 
end. 



0.57 



1CJL2 cathepsin L (EC 
3.4.22.15) mutant 
(F(78P)L, C25S, 
T110A,E176G, 
D178G), fragment 2 
human 



0.39 



I chromokinesin - I 0045 

chicken>GP:GGU1830 
9_1 Gallus gallus 
chromokinesin mRNA, 

I complete cds 

HYPOTHETICAL 24.2 1 1.90E-24 
KD PROTEIN 
C13A11.03IN 
CHROMOSOME 
I>GP:SPAC13A11_3 
S;pombe chromosome I 
cosmid cl3All; 
Unknown; 
SPAC13A11;03, 
unknown, len: 2 1 0 
Caenorhabditis elegans 
cosmid F21F8; Similar 
to eukaryotic aspartyl 
proteases 

CYCLIN-L1KE I 0.0015 

PROTEIN 

CLG1>PIR2:S37607 
cyclin-like protein 
YGL215w- yeast 
(Saccharomyces 
cerevisiae)>GP:SCYG 
L215W_1 S;cerevisiae 
chromosome VII 
reading frame ORF 
YGL215w>GP:YSCC 

LG1CPR_1 
Saccharomyces 
cerevisiae cyclin-like 
protein (CLGl)gene 
<NONE> 



0.43 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 



Table 2 



PO7US98/27610 



5EQ 
ID 



156 



ACCESSION) DESCRIPTION 



AF020282 



Dictyostelium 
discoideum 
DG2033 gene, 
partial cds. 



P 

VALUE 



I Nearest 
I Neighbor 
|(BlastX vs. 
|Non 

I Redundant 
I Proteins) 
ACCESSION 



0.56 I AC002125 4 



"description" 



157 



M86352 



Stigmatella 
aurantiaca reverse | 
transcriptase (163 
RT) gene, 
complete cds. 



0.56 



Homo sapiens DNA 
from chromosome 19- 
cosmid F25965, 
genomic sequence, 
complete sequence; 
F25965_5; 

Hypothetical 35;3 kDa 
protein similar to 
GTPase-activating 
proteins and orf3 from 



P 

VALUE 



0.6 



158 I AC00310I *** 



159 



B12117 



SEQUENCING 
IN PROGRESS 
***Homo 
sapiens 

chromosome 17, 
clone 

HRPC41C23; 
HTGS phase 1, 
33 unordered 

pieces. 

F5L15-T7 IGF 
Arabidopsis 
thaliana genomic 
clone F5L15. 



0.54 



AC002398_4 Human DNA from 
chromosome 19- 
specific cosmid 
F25965, genomic 
sequence, complete 
sequence; F25965_3; 
Hypothetical 96 kDa 
human protein similar 
to alpha chimaerin 
Hypothetical 
protein>GP:AC002398 
|_4 Human DNA from 
chromosome 19- 
specific cosmi 



4.50E-06 



<NONE> 



<NONE> 



<NONE> 



"034 I CEF32H2_5 



Caenorhabditis elegans 
cosmid F32H2, 
complete sequence; 
F32H2;5; Similarity to 
Chicken fatty acid 
synthase (SW:P12276); 
cDNAESTykI6c2;5 
comes from this gene; 

cDNAESTyk!13h6;5 
comes 



192 



BNSDOCID: <WO_9933982A2J_> 



WO 99/33982 



Table 2 



PCT/US98/27610 



SEQ 
ID 



I Nearest 
{Neighbor 
|(BlastN vs. 
IGenbank) 



ACCESSION 



160 I AE000664 



161 



B12117 



DESCRIPTION 



P 

VALUE 



I Nearest 
I Neighbor 

(BlastXvs. 

Non- 

1 Redundant 
1 Proteins) 
ACCESSION I 



DESCRIPTION 



Mus musculus 
TCR beta locus 
from bases 
250554 to 501917 1 
(section 2 of 3) of | 
the complete 
sequence. 



"034 I CET01G9_6 



ICaenorhabditis elegans 
cosmid T01G9, 
complete sequence; 
T01G9;4; CDN A EST 
yk29b7;5 comes from 
this gene 



F5L15-T7 IGF 
Arabidopsis 
thaliana genomic 
clone F5L15. 



162 



Z71261 



Caenorhabditis 
elegans cosmid 
F21C3, complete 
sequence. 



163 



M61831 



164 



U42608 



"0^4 | A39718 nicotinic acetylcholine 

receptor alpha chain - 
marbled electric ray 
(fragments) 



0.5 



|KDGE_DRU 
ME 



Human S- 
adenosylhomocys | 
teine hydrolase 
(AHCY) mRNA, 
complete cds. 



0.49 



|P2C2_ARA1 
H 



EYE-SPECIFIC 
DIACYLGLYCEROL 
KINASE (EC 
2.7.1.107) (RETINAL 
DEGENERATION A 
PROTEIN) 
(DIGLYCERIDE 
KINASE) 

(DGK)>GP:DRODAG 
K_l Fruit fly mRNA 
for diacylglycerol 
kinase, complete cds 



P 

VALUE 



0.84 



0.27 



4.60E-05 



Glycine max 
clathrin heavy 
chain mRNA, 
complete cds. 



0.48 



<NONE> 



PROTEIN 
PHOSPHATASE 2C 
(EC 3.1.3.16) 
(PP2C)>PIR2:S55457 
phosphoprotein 
phosphatase (EC 
3.1.3.16) 2C- 
Arabidopsis 
thaliana>GP:ATHPP2 
CA_1 Arabidopsis 
thaliana mRNA for 
1 protein phosphatase 2C 
<NONE> 
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Table 2 



PCT/US98/27610 



SEQ 
ID 



Nearest 
Neighbor 
(BlastN vs. 
Gen bank) 



ACCESSIONl DESCRIPTION 



165 



Z93042 



Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
6B17;HTGS 
phase 1 . 



166 



AC000044 I Human" 

Chromosome 
22ql3 Cosmid 
Clone p76el0, 
complete 
sequence. 



167 



X51508 I Rabbit mRNAforl 
aminopeptidase N ) 
(partial). 



169 



H.sapiens DNA 
segment 
containing (CA) 
repeat; clone 
AFM323yfl; 
single read. 

Z93042 Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
6B17; HTGS 
phase I. 



P 

VALUE 

0.47 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 



ACCESSION 



PYRD^BACS 
U 



DESCRIPTION 



P 

VALUE 



"0^47 |jvlATK_MAR 
PO 



0.47 I S45361 



0^5 I JQ2246 



DIHYDROOROTATE 0.002 
DEHYDROGENASE 
(EC 1.3.3.1) 

(DIHYDROOROTATE 
OXIDASE) 
(DHODEHASE)>PIR I 
:H39845 

dihydroorotate oxidase 
(EC 1.3.3.1). Bacillus 
subtilis>GPN:BSUB00 
09_25 Bacillus subtilis 
complete genome 
(section 9 of 21): from 
1598421 to 1807200; 
PROBABLE INTRON 
MATURASE>PIR2:A 
05034 hypothetical 
protein 370i - liverwort 
(Marchantia 
polymorpha) 
chloroplast>GP:CHMP 
XX21 Liverwort 
Marchantia 

polymorpha chloroplast 
genome DNA; 

ORF370i 

LRR47 protein - fruit | 5.30E-07 
fly (Drosophila 
melanogaster)>GP:DM 
LRR47_1 

D;melanogaster mRNA 
for LRR47 



22.5K cathepsin D | 0/79" 
inhibitor protein 
precursor - 

potato>GP:POTCATH 
D_l Potato cathepsin D 
inhibitor protein 
mRNA, complete cds 
Schistosoma mansoni | 0.0022 
elastase gene, 3045 bp 
clone, complete cds 
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BNSDOCID: <WO 9933982A2_I. 



PCTAJS98/27610 



WO 99/33982 



[Nearest 
1 Neighbor 
|(BlastN vs. 
lGenbank) 



SEQ JACCESSION 
ID 



Table 2 



DESCRIPTION 



P 

VALUE 



I Nearest 
Neighbor 
(BlastX vs. 

I Non- 
Redundant 

| Proteins) 
ACCESSION 



DESCRIPTION 



170 



171 



172 



173 



174 



Li 1 172 [Plasmodium 

falciparum RNA 
polymerase I 
gene, complete 
jcds. 

Z95889 Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** f rom clone 
211A9; HTGS 

[phase 1. 

U32772 Haemophilus 

influenzae from 
bases 954819 to 
966363 (section 
87 of 163) of the 
complete 
genome. 



"299281 Caenorhabditis 
elegans cosmid 
Y57G11C, 
complete 

| sequence. 

X04571 Human mRNA 
for kidney 
epidermal growth 
factor (EGF) 
precursor. 



0.43 



0.43 



IHUMPKD1G0 
8 1 



A09811 1 



0.43 



|YPT2_CAEB 
L 



"042 I PTU19464_1 



0.42 



|YEK9_YEAS 
T 



| Homo sapiens 
I polycystic kidney 
disease (PKDl)gene, 
exons 43-46; Polycystic] 
kidney disease 1 

1 protein 

| R;norvegicus mRNA 
I for BRL-3 A binding 
| protein; Author-given 
J protein sequence is in 
conflict with the 
conceptual translation 

HYPOTHETICAL 21.6 1 
KD PROTEIN 
F37A4.2IN 
CHROMOSOME 
III>PIR2:S44639 
F37A4.2 protein - 
Caenorhabditis 
elegans>GP:CELF37A 
4_8 Caenorhabditis 
elegans cosmid F37 A4 
Paramecium tetraurelia 
outer arm dynein beta 
heavy chain gene, 
I complete cds 

HYPOTHETICAL 53.9 
KD PROTEIN IN 
AFG3-SEB2 
INTERGENIC 
REGION>PIR2:S5047 
7 hypothetical protein 
YER019w- yeast 
(Saccharomyces 
cerevisiae)>GP:SCE95 
37_20 Saccharomyces 
cerevisiae chromosome 
Vcosmids 9537, 9581, 
9495, 9867, and 
lambda clone 5898 



1 



2.50E-28 



0.99 



195 



WO 99/33982 



Table 2 



PCT/US98/27610 



I Nearest 
[Neighbor 
((BlastN vs. 
I Gen bank) 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 



SEQlACCESSlON 
ID 



175 



U32772 



176 I AC002053 



177 



U88309 



M34025 



Haemophilus 
influenzae from 
bases 954819 to 
966363 (section 
87 of 163) of the 
complete 
genome. 



Human 
Chromosome 
9p22 Cosmid 
Clone 92f5, 
complete 
sequence. 

Caenorhabditis 
elegans cosmid 
T23B3. 



P 

VALUE 



ACCESSION 



0.41 



YPT2_CAEE 
L 



DESCRIPTION 



~0A I HSU33837 1 



HYPOTHETICAL 21.6 

KD PROTEIN 

F37A4.2 IN 

CHROMOSOME 

III>PIR2:S44639 
F37A4.2 protein - 
Caenorhabditis 
e!egans>GP:CELF37A 
4_8 Caenorhabditis 
elegans cosmid F37A 4 
Human glycoprotein 
receptor gp330 
precursor, mRNA, 
complete cds 



P 

VALUE 



7.80E-21 



0.4 



|dromttgn 

C 1 



179 I AC002395 



Human fetal Ig 
heavy chain 
variable region 
(clone M44) 
mRNA, partial 
cds. 



0.39 



|DNA2_YEAS 
T 



Drosophiia 
melanogaster 
mitochondrial 
cytochrome c oxidase 
subunit I (COI) gene, 5 f 
end, Trp-, Cys-, and 
Tyr-tRNA genes, 
NADH dehydrogenase 
subunit 2 (ND2) gene, 

3' end 

una REPLICATION 



Homo sapiens; 
HTGS phase 1, 
127 unordered 
pieces. 



039 I VV_MUMPE 



HELICASE 
DNA2>PIR2:S48904 
probable purine 
nucleotide-binding 
protein YHR3 64c- 
yeast (Saccharomyces 
cerevisiae)>GPN:YSC 
H9986_3 

Saccharomyces 
cerevisiae chromosome 
VIII cosmid 9986; 
Dna2p: DNA 
replication helicase; 
YHR1640GP: 

NONSTRUCTURAL 
PROTEIN V 

(NONSTRUCTURAL 
PROTEIN NS1) 



0.99 



0.11 
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WO 99/33982 



Table 2 



PCT/US98/27610 



Nearest 
Neighbor 
(BlastN vs. 
Genbank) 



ACCESSION 



DESCRIPTION I P 

VALUE 



180 I AC003101 



181 



Z54335 



182 



U95743 



183 I ACOOIZZV 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 
ACCESSION I 



DESCRIPTION 



184 



X01703 



*** . 

SEQUENCING 
IN PROGRESS 
** Homo 
sapiens 

chromosome 17, 
clone 

HRPC41C23; 
HTGS phase 1, 
33 unordered 

pieces. 

Human DNA 
sequence from 
cosmidL17A9, 
Huntington's 
Disease Region, 
chromosome 
4pl6.3. Contains 
VNTR and a CpG | 
island. 

Homo sapiens 
chromosome 16 
BAC clone 
C1T987-SK65D3, 
complete 
sequence. 



Sequence of BAC 
F5I14from 
Arabidopsis 
thaliana 
chromosome 1, 
complete 

sequence. 

Human gene tor 
alpha-tubulin (b 
alpha 1). 



0.39 



|YLK2_CAEE 
L 



0.39 



IHUMNFAT3 
A 1 



1 HYPOTHETICAL 
122.7 KD PROTEIN 
D1044.2IN 
CHROMOSOME 
II1>GP:CELD1044_4 
Caenorhabditis elegans 
cosmidD1044 



Homo sapiens NF-AT3 
mRNA, complete cds 



"038 I CEZC434_6 



P 

VALUE 



"034 I HSOCAMJ 



Caenorhabditis elegans 
cosmidZC434, 
complete sequence; 
ZC434;6; CDNAEST 
CEESO02F comes 
from this gene; cDNA 
EST CEESS60F comes 

from this gene 

H;sapiens mRNA for 
immunoglobulin-like 
domain-containing 1 
protein 



0.33 



|NTC3_MOUS 
E 



NEUROGENIC 
LOCUS NOTCH 3 
PROTEIN>PIR2:S453 
06 notch 3 protein - 
I mouse>GP:MMNOTC 
_1 M;musculus mRNA 
I for Notch 3 



0.0001 



0.18 



0.051 



0.012 
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Nearest 
Neighbor 
(BlastN vs. 
Gcnbank) 



Table 2 



PCT/US98/27610 



ID 



185 



186 



ACCESSION 



Z82189 



Z98051 



DESCRIPTION 



Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 

170A21;HTGS 
phase 1 . 



Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
501A4; HTGS 
phase 1. 



187 I 298749 



Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
449017; HTGS 
phase 1. 



188 



X96763 



C.albicans CDC4 
gene. 



P 

VALUE 



J Nearest 
J Neighbor 
KBlastXvs. 
INon- 

J Redundant 
I Proteins) 



ACCESSION 



0.31 



LG106 3 



0.3 



S34960 



DESCRIPTION 



Lemna gibba 
negatively light- 
regulated mRNA 
(Lgl06); Second 
longest ORF (2) 



0.3 



|5CKC_LEIQ 
H 



0^9 I CECC4 1 



NADH dehydrogenase 
(ubiquinone) (EC 
1 .6.5.3) chain 5 - 
Crithidia oncopelti 
mitochondrion 
(SGC6)>GP:MICOCN 
NR_3 Crithidia 
oncopelti mitochondrial 
ND4, ND5, COI, 12S 
ribosomal RNA genes 
for NADH 

dehydrogenase subunit 
4/5, cytochrome 
oxidase sub un 

CHARYBDOTOXIN 
(CHTX)(CHTX- 
LQ1)>PIR2:A60963 
charybdotoxin 1 - 
scorpion (Leiurus 
quinquestriatus)>3D:2 
CRD Charybdotoxin 
j(nmr, 12 structures) - 
scorpion (Leiurus 
quinquestriatus) 



Caenorhabditis elegans 
cosmid CC4, complete 
sequence; CC4;a; 
Protein predicted using 
Genefinder; 
I preliminary prediction 



P 

VALUE 



0.27 



0.25 



0.12 



1.30E-17 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 



Table 2 



SEQ 
ID 



(Nearest 
I Neighbor 
(BlastX vs. 
[Non- 

I Redundant 
1 Proteins) 



ACCESSION I DESCRlKi 1UIN 



ESSION | D 



189 



U38804 



Porphyra 
purpurea 
chloroplast 
genome, 
complete 
sequence. 



P 

VALUE 



ACCESSION I DESCRIPTION 



P 

VALUE 



0.28 



IHIVHCDR3C 
1 



I Human 

immunodeficiency 
virus type 1 heavy- 
chain complemetarity- 
determining region 3 
mRNA (clone 11), 
partial cds; Heavy- 
chain complementarity- 
determining region 3 
(CDR3)from HIV 
gP 120- 

>GP:HIVHCDR31J 
Human 

I immunodeficiency 
virus type 1 he 



1 



190 



U20657 



1 Human ubiquitin 
protease (Unph) 
proto-oncogene 
mRNA, complete 
cds. 



"028 I HSU20657J 



Human ubiquitin 
protease (Unph) proto- 
oncogene mRNA, 
complete cds 



5.60E-12 



191 I AC002037 



192 



U58748 



Human 

Chromosome 11 
Overlapping 
Cosmids 
cSRL72g7 and 
cSRL140b8, 
complete 
sequence. 
Caenorhabditis 
elegans cosmid 
ZK180. 



0.27 



IVRPl YEAS VERPROLlN>O P:SC Z.uuiwi 
1 "t VERPRLJ 1 

S;cerevisiae (A364) 
gene for verprolin 



0.27 



IexlFtoba 
c 



193 



Z68013 



Caenorhabditis 
elegans cosmid 
W02H3, 
complete 
sequence. 



0.26 



PISTIL-SEC1F1C' 
EXTENSIN-LIKE 
PROTEIN 
PRECURSOR 
(PELP)>PIR2:JQ1696 
pistil extensin-like 
protein precursor (clone I 
pMG15)- common 
tobacco>GP:NTPMGl 
5_1 N;tabacum mRNA 
for pistil extensin like 

I protein 

<NONE> 
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Table 2 



PCT/US98/27610 



SEQ 
ID 



194 



Nearest 
Neighbor 
(BJastN vs. 
Gen bank) 



ACCESSION 



AFO 17042 



195 



B03174 



196 I X70810 



197 



U80024 



DESCRIPTION 



Dictyostelium 
discoideum LTR- 
retrotransposon 
Skipper, partial 
genomic 
sequence, 5* end. 



P 

VALUE 



0.26 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 



ACCESSION I DESCRIPTION" 



SPBC31F10J 
4 



cSRL-16e2-u 

I cSRL flow sorted 
Chromosome 1 1 
specific cosmid 
Homo sapiens 
genomic clone 
cSRL-16e 2. 

E.graciiis 

chloroplast 

complete 

genome. 



Caenorhabditis 
elegans cosmid 
C18B10. 



0.26 |CELC30E1_7 



S;pombe chromosome 
II cosmid c3IF10; 
Hypothetical protein; 
SPBC31F10;14c, 
unknown, len:1586aa, 
some similarity eg; to 
YJR140C, 
YJ9H_YEAST, 
P47171, involved in 
ceii cycle regulation 



Caenorhabditis elegans 
cosmid C30E1 



0.25 



0.25 



J CEK10H10j$ Caenorhabditis elegans " 
cosmid Kl OH 10, 
complete sequence; 
K10H10;k; Protein 
predicted using 
Genefinder; 

T preliminary pred iction ■ 

JMMAF001794 Mus musculus Treacherl 
Collins Syndrome 
protein (Tcofl)mRNA, 
complete cds; Putative 
nucleolar 

phosphoprotein; similar j 
to Homo sapiens 
Treacher Collins 
syndrome TCOF1 
protein 

encoded>GP:MMAF00 j 
1794_1 Mus musculus 
Treacher Collins 
Syndrome p 



P 

VALUE 



1 



0.38 



0.98 



0.017 



200 
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Table 2 



[Nearest 
I Neighbor 
r J(BlastN vs. 
iGenbank) 



ACCESSION! DESCKlFllurT 



198 I AC000591 



199 I AC000591 



200 



Z99571 



I Drosophila 
melanogaster 
(subclone 9 j>3 
from PI DS01486 
(D32))DNA 
sequence, 
complete 
sequence. 



P 

VALUE 



[Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 

I Proteins) 
ACCESSION 



DESCRIPTION 



P 

VALUE 



0.25 



IYHGEJECOL 
1 



HYPOTHETICAL 64.6 1 U.UUU05 
KD PROTEIN IN 
MRCA-PCKA 
INTERGENIC 
REGION 

(F574)>PIR2:E65135 
hypothetical 64.6 kD 
protein in mrcA-pckA 
intergenic region - 
Escherichia coli (strain 

K- . 
12)>GP:ECAE000415_| 

7 Escherichia coli , 

mrcA, yrfE, yrfF, yrfG, 

yrfH, yrfl 



Drosophila 
melanogaster 
(subclone 9 _g3 
from PI DS01486| 
(D32))DNA 
I sequence, 
complete 
I sequence. 



201 



U00672 



Human DN A 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
388N15;HTGS 
phase 1. 



(Human 
interleukin-10 
receptor mRNA, 

1 complete cds. 



"0 25 1 YHGE ECOL H Y POTHET1C AL 04.0 
1 " KD PROTEIN IN 

MRCA-PCKA 
INTERGENIC 
REGION 

(F574)>PIR2:E65135 
hypothetical 64.6 kD 
protein in mrcA-pckA 
intergenic region - 
Escherichia coli (strain 

K- . 
12)>GP:ECAE000415_ 

7 Escherichia coli , f 
mrcA, yrfE, yrfF, yrfG, 

yrfH, yrfl 

HYPOTHETICAL 24.2 1 

KD PROTEIN 
C13A11.03IN 
CHROMOSOME 
I>GP:SPAC13A11_3 
S;pombe chromosome 1 1 
cosmidc!3All; 
Unknown; 
SPAC13A11;03, 
unknown, len: 210 
- Polypeptides entry 
for factor Oct-2.5 



0.24 



|YA53_SCHP 
O 



0.017 



"024 I TFDP00900 



201 



WO 99/33982 



Table 2 



PCT/US98/27610 




204 I B18861 



205 I U00672 



206 I X52105 



208 I Z49631 



*** 

SEQUENCING 
IN PROGRESS 
*** Mouse 
Chromosome 6 
BAC clone 
b245c!2; HTGS 
phase 2, 8 
ordered piece s. 
Homo sapiens 
microsatellite 
sequence in the 
HNF3agene. 



h20C18-Sp6 IGF~ 
Arabidopsis 
thaliana genomic 
clone F2 0C 18. 
Human 
interleukin-10 
receptor mRNA, 
complete cds. 



0.22 



coiiagen aipua 
1 (XVIII) chain - mouse 
(fragment)>GP:MUSC 
OLLAG_l Mouse 
mRNA for collagen, 
partial cds 
'i*ura>659 | -Polypeptides entry' 
for factor PR 



0.00072 



0.0003 



u.^ J TFDPouyuo -Polypeptides entry | 1.00E-05" 
for factor Oct-2.5 



Dictyostelium 
discoideum SP60 
gene for spore 
coat protein. 

Saccharopolyspor 
a erythraea 
insertion 

sequence IS 1136, 
copy B, 3* end. 
S.cerevisiae 
chromosome X 
reading frame 
ORF YJR131w , 
h.rubripes GSS 
sequence, clone 
043C17aB8. 



0.18 I <NONE> 



0J7 I D88764 1 



<NONE> <NONE> 



Rana catesbeiana 
mRNA for alpha 2 type 
I collagen, complete 
cds 



|YSCDAL1A_ Saccharomyces" 

1 cerevisiae alantoinase 

(DAL 1 ) gene, complete I 
I cds 

|^LC27A12_ Caenorhabditis elegans 1.30E-07 
8 cosmidC27A12; 

Partial CDS; this gene 
begins in the 
neighboring clone; 
coded for by C; elegans j 
cDNAykl27fl;3; 
coded for by C; elegans I 
|cDNA yk 1 27fl ;5 
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Table 2 



PCT/US98/27610 



SEQ 
ID 



Nearest 
Neighbor 
(BlastN vs. 
Genbank) 



ACCESSION! DESCRlKllON 



210 



U92852 



211 



X62620 



I Rhoiptelea 
chiliantha 
maturase (matK) 
gene, chloroplast 
gene encoding 
chloroplast 
protein, complete 
|cds. 

B.mori Abd-A 
gene homeobox. 



P 

VALUE 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 
ACCESSION 



~0T5 I SEU40259_5 



DESCRIPTION 



Staphyloccous 
epidermidis 

trimethoprim resistance 
plasmid pSK639;Orf53 



~2\2 | J02079 lepstein-barr virus 
simple repeat 
array (ir3). 



-q"Y5 — f ATAP22_36 Arabidopsis thaliana 
DN A chromosome 4, 
ESSAI AP2 contig 
fragment No; 2; 
Hypothetical protein; 
Similarity to NADH 
dehydrogenase, 
Chondrus crispus; 
1mNOS:S59107 

-qJ5 | A38346 



P 

VALUE 



0.95 



0.75 



ultra-high-sulfur 

keratin 1 - 
mouse>GP:MUSSERl 
1 Mouse serine 1 ultra 
high sulfur protein 
gene, complete cds; 
Putative 



7.50E-05 



213 



M35027 



[ Vaccinia virus, 
complete 
genome. 



0.14 



"214 I AC003058" 



*** 

SEQUENCING 
IN PROGRESS 
*** Arabidopsis 
thaliana 'IGF 
BAC T27F23* 
genomic 
sequence near 
marker 
CIC06E08'; 
HTGS phase 1,8 
unordered pieces. 



0.14 



IMTFl FUSN MODIFICATION 

U METHYLASE FNUDI 

(EC 2.1.1.73) 
(CYTOSINE- 
SPECIFIC 

METHYLTRANSFER 
ASE FNUDI) 
(M.FNUDI) 

IHEXA DICD1 BblA- 

HEXOSAMINIDASE 
ALPHA CHAIN 
PRECURSOR (EC 
3.2.1.52) (N-ACETYL- 
I BETA- 

GLUCOS AM1NIDASE I 

) (BETA-N- 
ACETYLHEXOSAMI 
NIDASE)>PIR2:A307 
66beta-N- 

acetylhexosaminidase 
(EC 3.2.1.52) A 
precursor - slime mold 
(Dictyostelium 



0.87 



0.006 
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Table 2 



PCT/US98/27610 



[Nearest 
I Neighbor 
((BlastN vs. 
iGenbank) 



SEQ IACCESSION 
ID 



DESCRIPTION 



Nearest 
Neighbor 
(BlastX vs. 
Non- 

Redundant 
Proteins) 
ACCESSION 



I AC001229 




DESCRIPTION 



Sequence of BAC 
F5I14from 
Arabidopsis 
thaliana 
chromosome 1, 
complete 
sequence. 



Capra hircus 
beta-mannosidase 
mRNA, complete 
cds. 



0.12 



S70663 



*vi I ACO0038O 



P 

VALUE 



discoideum)>GP:DDIN 
AGAJ D;d 



SEQUENCING 
IN PROGRESS 
*** Human 
Chromosome 3 
pacpDJ70ilI; 
HTGS phase 1,2 
unordered pieces. 



0.12 I ATFCA8 19 



pol protein - simian T- 
cell lymphotropic virus 
type 1, STLV-1 (isolate 
Bab34) 

(fragment)>GP:STVB 
ABPOLA J Simian T- 
cell leukemia virus 
PCR derived (pol) 
gene, partial sequence 
BAB34POL; Bases 
4779-4918 EMBL 
ATK numbering 
system; BAB34POL 



0.77 



lectin heavy chain, N- 
acetylgalactosamine- 
specific - Entamoeba 
histolytica 

(fragment)>GP:EHU33 
443 J Entamoeba 
listolytica GalNAc 
ectin heavy subunit 

(hgW) gene, partial cds; 

N-acetylgalactosamine 

adherence lectin heavy 

subunit 



0.8 



Arabidopsis thaliana 
DNA chromosome 4, 
ESSA I contig 
fragment No; 8; 
Unnamed protein 
product 



0.64 
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Table 2 



PCTAJS98/27610 



ID 



I Nearest 
Neighbor 
(BlastN vs. 

1 Gen bank) 



lACCESSIONI DESCR1KI1UIN 



P 

VALUE 



I Nearest 
I Neighbor 
I (Bias tX vs. 

1 Redundant 
1 Proteins) 
ACCESSION I 



DESCRIPTION 



P 

VALUE 



218 



X61207 



219 I AF0142b9 



220 I ACOOUWO 



(A.brasilense hisB,| 
H, A,FandE 
genes for 
imidazole 
glycerolphosphat 
e dehydratase, 
glutamine 
amidotransferase, 
phosphorybosilfo 
rmimino-5- 
amino- 

phosphorybosil- 
4- 

imidazolecarboxa 
mide isomerase, 
I cyclase and 
phosphorybosii- 
AMP- 

I cyclohydrolase. 
HIV-1 Patient 
1088 from 
Edinburgh, MA- 
pl 7 (gag) gene, 
I partial cds. 
Drosophila 
melanogaster 
(subclone 2_cll 
from PI DS07660| 
(D44))DNA 
sequence, 
complete 
sequence. 



~Ol2 I OCCL02_l 



"221 AC002428 



I Human BAC 
clone GS039E22 

I from 5q31, 
complete 

I sequence. 



222 



L40949 



Homo sapiens 
(clone AT7-5eu) 
opioid-receptor- 
like protein 
mRNA, 5* end. 



1 0;circumcincta colost- 
2 gene; Cuticular 
collagen 



0.0074 



"Oil I DMU88570_1 Drosophila 



0.11 



melanogaster CREB- 
binding protein 
homolog mRNA, 
complete cds; CBP 
hypothetical protein in 
dmsC 3' region - 
Escherichia coli (strain 

K- i 
12)>GP:ECAE000192_ 

1 Escherichia coli , 

ycaD, ycaK, pflA, pflB, 

fbcA genes from bases 

944908 to 955952 

(section 82 of 400) of 

the complete genome; 

Hypothetical protein in 

dmsC 



"On I HSNMYC2_1 



Human N-myc gene 
exon 2; Put; N-myc 
protein (aa 1-263) (953 
is 1st base in codon) 



"5n I CEUNC93J2 



0.051 



0.00014 



C;elegans unc-93 gene; 
Protein 2 



1.20E-13 
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PCT/US98/27610 



ID 



Nearest 
Neighbor 
(BlastN vs. 
Gen bank) 



ACCESSION 



223 



224 



AL008636 



D86993 



DESCRIPTION 



Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
722E9; HTGS 
phase 1 . 



225 TAC002539" 



Human (lambda) 
DNA for 
immunoglobulin 
light chain 



Homo sapiens 
chromosome 17, 
clone 195o20, 
complete 
sequence. 



226 I M881 65 



Human inter- 
alpha-trypsin 
inhibitor light 
chain (ITI) gene, 
exon 1 . 



227 



Z92851 



Caenorhabditis 
elegans DNA*** 
SEQUENCING 
IN PROGRESS 
*** from clone 
Y39G8; HTGS 
[phase 1. 



P 

VALUE 



o.i 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 
ACCESSION 



XELCOL2A1 
A 1 



CELM02B7 2 



DESCRIPTION 



Xenopus laevis alpha- 1 
collagen type IF 
mRNA, complete cds; 
Alpha-1 type IF 
collagen 



Caenorhabditis elegans 
cosmid M02B7 



P 

VALUE 



2.60E-06 



1.80E-09 



0.098 



ImtCY7D11_ 
17 



A54161 



0.082 



[CYA7JBOVT 
N 



Mycobacterium ~ | 0.026 
tuberculosis cosmid 
Y7D11; Unknown; 
MTCY07Dll;17c; 
unknown, len: 186aa, 
FASTA best: Ql 0390 
Y009_MYCTU 
hypothetical 31;0KD 
protein MTCYI90;09C 
(299 aa) opt: 355 z- 
score: 316;8 

ryanodine-binding 
protein alpha form - 
buIlfrog>GP:D2 1 070 J 
Rana catesbeiana 
mRNA for bullfrog 
skeletal muscle calcium | 
release channel 
(ryanodine receptor) 
alpha isoform(RyRl), 
complete cds; 
Ryanodine receptor 
alpha isoform 

ADENYLATE — 1 oT 

CYCLASE, TYPE VII 
(EC 4.6.1.1) (ATP 
PYROPHOSPHATE- 
LYASE) (ADENYLYL 
CYCLASE) 
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Table 2 



PCTAJS98/27610 



[Nearest 
Neighbor 
|(BlastNvs. 
lGenbank) 



p^earest 
I Neighbor 
J(BlastX vs. 
INon- 

I Redundant 
| Proteins) 



seQ |accession 

ID 



L00638 



Arabidopsis 
thaliana ubiquitin 
conjugating 
enzyme exons 2- 



DESCRIPTION 



P 

VALUE 



I ACCESSION 



0.072 



|NUCM_TRY 
BB 



NADH-UBIQUINONE 
OXIDOREDUCTASE 
49 KD SUBUNIT 
HOMOLOG (EC 
6.5.3) (NADH 
DEHYDROGENASE 
SUBUNIT 7 
HOMOLOG)>PIR2:A3 
5693 NADH 
dehydrogenase (EC 
1.6.99.3) chain 7- 
Trypanosoma brucei 
mitochondrion (SGC6) 



DESCRIPTION 



P 

VALUE 



0.24 



U49169 



Dictyostelium 
discoideum V- 
ATPase A 
subunit (vatA) 
mRNA, complete 
cds. 



"O071 IMMU65594J 



Mus musculus Brca2 
mRNA, complete cds; 
Similar to human breast 
cancer susceptibility 
gene BRCA2; Allele: 
wild type; putative 
tumor suppressor 



230 I AF001549 



Homo sapiens 
chromosome 16 
BAC clone 
CIT987SK- 
270G1 complete 
sequence. 



0.07 



|PM22_HUMA 
N 



PERIPHERAL 
MYELIN PROTEIN 22 
(PMP- 

22)>PIR2:JN0503 
peripheral myelin 
protein 22 - 
human>GP:HUMGAS 
3X_1 Human 
peripheral myelin 
protein 22 (GAS3) 
mRNA, complete 
cds>GP:HUMPMP22_ 
1 Human peripheral 
myelin protein 22 
mRNA, complete 
cds>GP:HUMPMP22 



L36829 



Mus musculus 
alphaA-crystallin 
binding protein I 
(AlphaA- 
CRYBPl)gene, 
complete cds. 



0.066 



<NONE> 



<NONE> 



232 I AC000159 



*** 



0.058 I CEZK863J 



SEQUENCING 
IN PROGRESS 
*** Human BAC 
Clone llq!3; 



Caenorhabditis elegans 
cosmid ZK863, 
complete sequence; 
ZK863;2; Similar to 
collagen 
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Table 2 



PC77US98/27610 




234 



*** 

SEQUENCING 
IN PROGRESS 
*** Human BAC 
Clone 1 lq 1 3; 
HTGS phase'l, 
10 unordered 
(pieces. 



CUTICLE 
COLLAGEN 2C 
(FRAGMENT)>GP:H 
AECOL2CJ 
Hjcontortus collagen 
2C mRNA, 3'end 



Z23908 



H. sapiens 
(D5S630)DNA 
segment 
containing (CA) 
repeat; clone 
AFM268zd9; 
single read. 



0.057 



235 | B21875 |T3E8-Sp6TAMU| 
Arabidopsis 
thaliana genomic 
clone T3E8. 



0.055 



VEU34999J Venezuelan equine 
encephalitis virus 
nonstructural and 
structural polyprotein 
genes, complete cds; 
Nonstructural 
polyprotein; Internal 
stop codon, 

readthrough occurs 5% 
of the time 



| YRR2CAEE 
L 



236 



Z98303 



Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 

I*** from clone 
I40H19; HTGS 

(phase 1. 



0.048 



HYPOTHETICAL 91.1 
KD PROTEIN R144.2 
IN CHROMOSOME 
III>GP:CELR144J7 
Caenorhabditis elegans 
cosmidR144; Coded 
for by C; elegans 
cDNA CEESP84R; 
coded for by C; elegans 
cDNA yk23c4;5; coded 
for by C; elegans 
cDNA yk44f9;5; coded 
for by C; eleg 



AC002330_3 Arabidopsis thaliana 
BAC T10P11, 
complete sequence; 
Putative zinc-finger 
protein; C2H2 Zn- 
finger signature from 
position 80 to 100 
[CEICNKGFQRDQNL 
1 QLHRRGH] 



0.0002 



0.68 



0.99 
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Table 2 



SEQ 
ID 



Nearest 
Neighbor 
(BlastN vs. 
Genbank) 



ACCESSlONl description 



p 

VALUE 



I Nearest 

1 Neighbor 
(BlastXvs. 

I Non- 
Redundant 

1 Proteins) 
ACCESSION 



237 



D49911 



Thermus 
thermophilus 
UvrA gene, 
complete cds. 



0.044 



|APPl_MOUS 

E 



DESCRIPTION 



P 

VALUE 



238 I D49911 



Thermus 
thermophilus 
UvrA gene, 
complete cds. 



0.044 



|MMCOL18Al 
1 2 



AMYLOID-LIKE 
PROTEIN 1 
PRECURSOR 
(APLP)>PIR2:A46362 
amyloid precursor-like 
protein - 

mouse>GP:MUSAPLP 
_1 Mouse amyloid 
precursor-like protein 
mRNA, complete cds 
Mus musculus alpha- 
1 (XVIII) collagen 
(COL18Al)gene, 
exons 40-43, complete 
cds 



239 I X78119 



P.amygdalus, 
Batsch (Texas) 
jprul mRNA. 



0.042 



|CA44_HUMA 
N 



"240T U72877 



Rana catesbeiana 
L-epinephrine 
transporter 
mRNA, complete 
cds. 



0.041 



|YRR6_MYCL 
A 



241 



L39891 



242 



L40390 



Homo sapiens 
polycystic kidney 
disease- 
associated protein 
(PKDl)gene, 
1 co mplete cds. 
Candida glabrata 
ERG3gene, 
I complete cds. 



0.04 



|MUC2_HUM 
AN 



G01763 



COLLAGEN ALPHA | 2.UUt-uo 

4(IV) CHAIN 
PRECURSOR>PIRl:C 
GHU1B collagen alpha 
4(IV) chain precursor - 
human>GP:HSCOL4A 
4_1 H;sapiens mRNA 
for collagen type IV 
alpha 4 chain; Type IV 
collagen alpha 4 chain 
HYPOTHETICAL 33.U 
KD PROTEIN IN 
LICA 3 , REGION(ORF 
R6)>PIR2:S42125 
hypothetical protein 3 - 
Mycoplasma 
capricolum 
(SGC3)>GP:MYCRP 
MH_6 M; capricolum 
rpmH, rnpA and HcA 
gene;Orf R6 
MUCIN 2 
(INTESTINAL 
MUCIN 2) 
(FRAGMENTS) 



atrophin-1 
human>GP:HSU23851 
_1 Human atrophin-1 
mRNA, complete cds 
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SEQ 
ID 



ACCESSION 



243 



244 



B281I3 



AC000030 



DESCRIPTION 



T2L16TRB 
TAMU 
Arabidopsis 
thaliana genomic 
clone T2L16. 
uur/5, complete 
sequence. 



P 

VALUE 



0.038 



0.033 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 
ACCESSION 



CELZK1248_ 
14 



ATFCA8_40 



"MSCRIPTION' 



P 

VALUE 



245 I B 10738 



246 



MlKil5-Sp6 IGF 
Arabidopsis 
thaliana genomic 
[clone F13G15. 



0.0J2 I D87521 1 



Arabidopsis thaliana | 06T 

DNA chromosome 4, 

ESSA I contig 

fragment No; 8; 
GIycerol-3-phosphate 
permease homolog; 
Similarity to glycerol- 
3-phosphate permease • 
Haemophilus 
influenzae 

Mus musculus DNA- | 02T 
PKcs mRNA, complete 
cds 



247 I Z49888 



Caenorhabditis 
elegans cosmid 
F31F4. 

Caenorhabditis 
elegans cosmid 
F47A4, complete 
sequence. 



248 



249 I Z94I61 



Human DNA 
sequence from 
PAC 306D1 on 
chromosome X 
contains ESTs. 

Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
N102CI0; HTGS 



I 1W*U52064J Kaposi's sarcoma- 

associated herpes-like 
virus ORF73 homolog 
gene, complete cds; 
I Herpesvirus saimiri 
ORF73 

homolog>GP:KSU756 
98J78 Kaposi's 
sarcoma-associated 
I herpesvirus long unique 
region, 80 putative 
ORF's and kaposin 
gene, complete cds; OR I 



SYNTHETASE II 
(GRAMICIDINS 
BIOSYNTHESIS 

GRSB PROTEIN) (EC 
«_ 6.-.-.-) 

°" 025 ■ S16323 hypothetical protein - 
Arabidopsis 
thaliana>GP:ATHB 1 J 
A;thaliana homeobox 
gene Athb-1 mRNA; 
Open reading frame 



0.0079 



V/O 99/33982 



Table 2 



PCTAJS98/27610 



SEQ 
ID 



[Nearest 
[Neighbor 
|(BlastN vs. 
lGenbank) 



ACCESSION 



250 I AC002094 



251 



D79994 



252 



phase 1. 



DESCRIPTION 



Genomic 
sequence from 
Human 17, 
complete 
sequence. 



Human mRNA 
for KIAA0172 
gene, partial cds. 



P 

VALUE 



(Nearest 
Neighbor 
(BlastXvs. 
Non- 
Redundant 
[ Proteins) 
ACCESSION I 



DESCRIPTION 



0.021 



S57447 



P 

VALUE 



0.021 |CER10H10_l 



254 I AC002543 



Human DN A 
sequence *** 
SEQUENCING 
IN PROGRESS 
J*** from clone 
438L4;HTGS 

(phase 1. 

X.laevis mRNA 
for selenocysteine I 
tRNA acting 
factor (Staf). 
Human BAC 
clone RG300C03 
from 7q31.2, 
complete 
sequence. 



0.017 



HPBRH-7 protein 
human>GP:HSHPBRH 
4_1 H;sapiensHPBRII- 
4 

mRNA>GP:HSHPBRII 
7 J H;sapiens HPBRII- 

7 gene 

Caenorhabditis elegans 
cosmid R10H10, 
complete sequence; 
R11A8;7; Protein 
predicted using 
Genefinder; Similarity 
to Mouse ankyrin (PiR 
Acc; No; S37771); 
cDNA EST CEESX25F 
comes from this gene; 
lcELW05H7__4 1 Caenorhabditis elegans 
cosmid W05H7 



8.20E-08 



0.017 



0.013 



JN0786 



[mzlmtcyt 

BT 1 



0.24 



integrin beta-4 chain 
precursor - mouse 

Mendozellus isis 
mitochondrial N ADH 
dehydrogenase, and 
cytochrome b genes, 3 1 
end, and transfer RN A- 
Ser gene; This codes 
for the last 43 amino 
acids of N ADH 
dehydrogenase subunit 
1 followed 



0.088 



0.044 
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Nearest 
Neighbor 
(BlastN vs, 
Genbank) 




256 



257 ] U62317 



258 



D29655 



^9 I AF002992 



260 



B20752 



Caenorhabditis 
elegans cosmid 
T20BI2. 



0.012 



[MMMHC29N 
7 2 



Saccharomyces 
cerevisiae protein 
phosphatase 
(PTCl)gene, 
complete cds . 
Chromosome 
22ql3BAC 
Clone 
CIT987SK- 
384D8 complete 
sequence. 



0.01] 



0.0093 



|P2yOenT 

A 



Pig mRNAfor 
UMP-CMP 
kinase, complete 
cds. 

Homo sapiens 
cosmid from 
Xq28, complete 
sequence. 



U.0075 I AF004858 1 



Mus musculus major 
histocompatibility 
locus class III 

region:butyrophiIin-Iike 
protein gene, partial 
cds; Notch4, PBX2, 
RAGE, lysophatidic 
acid acyl transferase- 
alpha, palmitoyN 
Human (gene i) DNA 
for phosphatase 2C 
motif, partial cds 



F2Y PURINOCEPTOR 
8 

(P2Y8)>GP:XLP2Y8_ 
1 X;laevis mRNA for 
P2Y8 nucleotide 
[ receptor 



0.069 



[fbnTboW" 

N 



T19M2-T7 
TAMU 
Arabidopsis 
thaliana genomic 
clone T19M2. 



0.0043 



I Mus musculus platelet 
activating factor 
receptor mRNA, partial 
cds; PAF-receptor 
FIBRILLIN 1 

PRECURSOR>PIR2:A 
55567 fibrillin I - 

bovine>GP:BOVXAA 
AA_J Bos taurus 
mRNA, complete cds; 
Putative 



HSVT1IEP_I Feline herpesvirus type 
1 gene for immediate 
early protein, complete 
cds; Feline herpesvirus 
type 1 immediate early 
protein 



0.89 



0.0004 



3.90E-05 
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Table 2 



[Nearest 
[Neighbor 
(BlastN vs. 
iGenbank) 



I ACCESSION I DESCRirnuiN 



261 I AB006699 



I Arabidopsis 
thaliana genomic 
DNA, 

chromosome 5, 
PI clone: MDJ22. 



P 

VALUE 



[Nearest 
I Neighbor 
|(BlastXvs. 

I Redundant 
I Proteins) 
1 ACCESSION 



0.0037 



|YHV5_YEAS 
T 



262 I Z99128 



263 I B21848 



264 I L33853 



265 I B36863 



Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
422H11;HTGS 
phase 1 
T2D2-Sp6 
TAMU 
Arabidopsis 
thaliana genomic 
clone T2D2 
Human germline 
immunoglobulin 
kappa chain 
variable region 
(Vk-IV subgroup) 
foranti-B- 
amyloid 
autoantibodies in 
Alzheimer ! s 
disease. 
HS-1042-A1 
F01-MR.abi CIT 
Human Genomic 
Sperm Library C 
Homo sapiens 
genomic clone 
Plate=CT 824 
Col=l Row=K. 



0.0032 



|ALUl_HUM 
AN 



B31794 



B45550 



DESCRIPTION 



HYPOTHETICAL 
143.6 KD PROTEIN 
IN SPO16-REC104 
INTERGENIC 
REGION>PIR2:S4675 
4 hypothetical protein 
YHR155w- yeast 
(Saccharomyces 
cerevisiae)>GPN:YSC 
H9666_15 
Saccharomyces 
cerevisiae chromosome 
VIII cosmid 9666; 
Yhrl55wp; Similar to 

Sip3p (Snf 

!!!! ALU 



0.077 



0.0087 



SUBFAMILY J 
WARNING ENTRY 



IfH 



mdm-1 protein (clone 
c!03)- mouse 



cytochrome b homolog 
- Plasmodium yoelii 



0.0027 



|YQK4_CAEE 
L 



1.00E-05 



0.99 



HYPOTHETICAL 64.3 
KD PROTEIN 
C56G2.4 IN 
CHROMOSOME 
II1>GP:CELC56G2_2 
Caenorhabditis elegans 
cosmid C56G2 



0.81 
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: :f.:-:V-:^? : | 


Nearest 
Neighbor 
(BlastN vs. 
Genbank) 




Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


INSCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


266 


AC003041 


*** 

SEQUENCING 
IN PROGRESS 
*** Homo 
sapiens 

chromosome 17, 
clone 

HCIT307A16; 
HTGS phase 1, 

1 0 nnnrrlAi¥»rl 


0.0024 


GLB4 LAMS 
P 


GIANT 

HEMOGLOBIN AIV 
CHAIN 

(FRAGMENT)>PIR2: 
SOI 810 hemoglobin 
AIV - tube worm 
(Lamellibrachia sp.) 
(fragment) 


0.94 



267 I AC002315 



268 I AF0 16674 



2W I AF0 16674 



270 I AF016674 



271 



254199 



pieces. 



Mouse BAC- 
146N21 

Chromosome X 

contains 

iduronate-2- 

sulfatase gene; 

complete 

sequence. 



Caenorhabditis 
elegans cosmid 
C03H5. 
Caenorhabditis 
elegans cosmid 
C03H5. 



0.0015 



|SCYJL204C_ 
1 



0.0015 I CEM199 3 



Caenorhabditis 
elegans cosmid 
C03H5. 



esculentum 
DNA Ailsa craig 
encoding 1- 
aminocyclopropa 
ne-l-carboxylic 
acid oxidase. 



0.0022 | MG42JTARM I SR Y-RELATED | OW 

A I PROTEIN MG42 

(FRAGMENT)>PIR3:I 
51369 Sry-related 
sequence - Tarentola 
mauritanica 
(fragment)>GP:TELM 
G42DNA_1 Gecko 
MG42 gene, partial 
cds; Sry-related 
sequence 
S;cerevisiae 
chromosome X reading 
frame ORF YJL204c 

Caenorhabditis elegans | 097" 
cosmid Ml 99, 
complete sequence; 
Ml 99;e; Protein 
predicted using 
Genefmder; 
preliminary prediction 
Caenorhabditis elegans | 0.97 
cosmid Ml 99, 
complete sequence; 
M199;e; Protein 
predicted using 
Genefmder; 

preliminary prediction 

0.00 id | CfcLF20Al_5 | Caenorhabditis elegans | — oTT 
cosmid F20A1; Coded 
for by C; elegans 
cDNA yk9gl;3; coded 
for by C; elegans 
cDNA yk9gl;5; coded 
for by C; elegans 
cDNA CEE SU55F; 
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I Nearest 
I Neighbor 
|(BlastN vs. 
lGenbank) 



ACCESSION 



Z99943 



273 



S81083 



DESCRIPTION 




I Nearest 
Neighbor 
(BlastX vs. 
Non- 

1 Redundant 
[Proteins) 
ACCESSION 



Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
313L4; HTGS 
phase 1. 



DESCRIPTION 



weak similarity to 
putative 



CEK08F8_5 Caenorhabditis elegans 
cosmid K08F8, 
complete sequence; 
K08F8;5b 



I beta - 

ADD=adducin 
beta subunit 63 
kda 

isoform/membran 
e skeleton 
protein, beta- 
ADD=adducin 
beta subunit 63 
kda 

isoform/membran 
e skeleton protein 
{alternatively 
spliced, exon 10 
to 13 region} 
[human, 
Genomic, 1851 
nt, segment 3 of 
3]. 



0.0013 I MTCY277J7 



Mycobacterium 
tuberculosis cosmid 
Y277; Unknown; 
MTCY277;07c, 
unknown, len: 302 



P 

VALUE 



0.93 



0.0001 



274 | Z82174 I Human DNA 
sequence from 
cosmid B20F6on 
chromosome 

22qll.2-qter. 

275 | Z82215 Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
6802;HTGS 
phase 1 . 



0.001 



0.00079 



|FBLA_HUM 
AN 



|bfri_schp 

O 



FIBULIN-1, 

ISOFORMA 

PRECURSOR>GP:HS 

FIBUA_1 H;sapiens 

mRNAforfibulin-1 A 

BREFELDIN A 

RESISTANCE 

PROTEIN>PIR2:S522 
39 hba2 protein - 
fission yeast 
(Schizosaccharomyces 
pombe)>GP:SPHBA2 
GEN_1 S;pombehba2 
gene 



0.15 
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Table 2 



Nearest 
Neighbor 
(BlastN vs. 
Genbank) 




ACon?^n 



279 



U91322 



280 



D 16986 



DESCRIPTION 



Caenorhabditis 
elegans UNC-76 
(unc-76) gene, 
complete cds. 



Human DNA 
sequence from 
clone J362G17K 



P 

VALUE 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 



ACCESSION 



0.00071 



0.00054 



clone RG341D10 
from 7pl5-p21, 
complete 
sequenc e. 
Human 
chromosome 
16pI3 BAC clone I 
CIT987SK-276F8| 
complete 
sequence. 



CX2 HEMHA 



DMU34925 2 



<~eLi 2Sh2 



DESCRIPTION 



C YTOTOXIN 2 
(TOXIN 12 A) 



Drosophila 
melanogaster DNA 
repair protein (mei-41) 
gene, complete cds, and 
TH1 gene, partial cds 



U.0U051 I CEW08D2 



Human HepG2 
partial cDNA, 
clone 

hmd2b09m5. 



[POLGJPPVN 
A 



281 



U91318 



Human 
chromosome 
16pl3 BAC clone | 
CIT987SK- 
962B4 complete 



0.00031 I <NONE> 



Caenorhabditis elegans 
cosmid T28F2; Weak 
similarity to HSP90 



P 

VALUE 



0.32 



0.045 



0.26 



0.48 



Caenorhabditis elegans 
cosmid W08D2, 
complete sequence; 
W08D2;3; Protein 
predicted using 
Genefinder>GP:CEW0 
8D2_2 Caenorhabditis 
elegans cosmid 
W08D2; W08D2;3; 
Protein predicted using 
Genefinder 
GENOME 
POLYPROTEIN 
(CONTAINS: N- 
TERMINAL 
PROTEIN; HELPER 
COMPONENT 
PROTEINASE (EC 
3.4.22.-) (HC-PRO); 
42-50 KD PROTEIN; 
CYTOPLASMIC 
INCLUSION 
PROTEIN (CI); 6 KD 
PROTEIN; NUCLEAR 
INCLUSION 
PROTEIN A (NI- A) 
(EC 3.4.22.-) (49K 
PROTEINASE) (49 
<NONE> I <NONE> I 



0.037 
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"283 I AC002398 



Human dispersed 
Alu repeats and 
dispersed LI 
repeat. 

Human DNA 
from 

chromosome 19- 
specific cosmid 
F25965, genomic 
sequence, 
complete 
sequence. 



0.0003 I VU8_SPV4 



|hmca_dru 

ME 



"284 I AC002530' 



Human BAC 
clone RG341D10 
from 7pl5-p21, 
complete 
I sequence. 



0.0002 I PL0009 



I GENE 8 

PROTEIN>PIRl:G8BP| 
SV gene 8 protein - 
spiroplasma virus 4 
(SGC3) 
HOMEOT1C 
CAUDAL 

PROTEIN>PIR2:A263 
57 homeotic protein 
Cad -fruit fly 
(Drosophila 
melanogaster)>GP:DR 
OCADA2J 
D;melanogaster caudal 
gene (cad) encoding a 
maternal and zygotic 
transcript, exon 2; 
Caudal 

protein>TFD:TFDP001 
59 - Polypeptides en 
complement 
C3d/Epstein-Barr virus 
receptor precursor - 
human 



0.021 



0.7 



285 I X01871 



| Yeast 

mitochondrial 
ori(o) repeat unit 
of petite mutantS 
(petite strain s- 
10/7/2). 



0.00015 



|rvzmtcyt 

BT 1 



Reventazonia sp; 
mitochondrial NADH 
I dehydrogenase, and 
cytochrome b genes, 3' 
end, and transfer RN A- 
Ser gene; This codes 
for the last 43 amino 
acids of NADH 
dehydrogenase subunit 
1 followed 



0.73 



286 



U89984 



(Acanthamoeba 
castellanii 
transformation- 
sensitive protein 
homolog mRNA, 
complete cds. 



0.00015 |ACU89984_ 



Tl Acanthamoeba 
castellanii 
transformation- 
sensitive protein 
homolog mRNA, 
complete cds; Similar 
to human 
transformation- 



4.20E-13 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 



Table 2 



PCT/US98/27610 



SEQ 
ID 



287 



ACCESSION 



AC002365 



DESCRIPTION 



^83 I AC002390 



W I AC002980 



290 I M99412 



Homo sapiens 
chromosome X 
clone U177G4, 
U152H5, 
U168D5, 174A6, 
U172D6, and 
U186B3from 
Xp22, complete 
sequence. 
Human DNA 
from overlapping 
chromosome 19- 
specific cosmids 
R30072 and 
R28588, genomic 
sequence, 
complete 
sequence. 
Homo sapiens; 
HTGS phase I, 
34 unordered 
(pieces. 



P 

VALUE 



0.00011 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 



ACCESSION 



DESCRIPTION 



sensitive protein: 
SwissProt Accession 
Number P3 1948 



VALUE 



LUE I 



SI 0340 



&\ I AC000120 



[Human 
interleukin-8 
receptor (IL8RB) 
gene, complete 
cds. 



9.90E-05I D86603 1 



9.20E-05 ITRBKPCYB 
1 



4.50E-05 I S28832 



UNA-directed RNA 
polymerase (EC 
2.7.7.6) - yeast 
(Kluyveromyces 
marxianus var. lactis) 



Mouse mRNA for Bach 
protein I, complete cds; 
Bachl 



0.00062 



Trypanosoma brucei 
kinetoplast 

apocytochrome b gene, 
complete cds 



0.52 



Human BAC 
clone RG161K23 
from 7q21, 
complete 
I sequence. 



4.O0E-05 I SXSCRBA 1 



microtubule-associated 
protein HI (clone 
KS3.1) - longfin squid 
(fragment) 



0.88 



Sjxylosus scrB and 
scrR genes; Sucrose 
repressor 



0.99 
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Table 2 



PCT/US98/27610 



I Nearest 
(Neighbor 
|(BlastN vs. 
I Gen bank) 



I Nearest 
[Neighbor 
|(BlastX vs. 

1 Redundant 
I Proteins) 



~SEQ~ 
ID 



I ACCESSION I DESCR1F1 1UIN 



P 

VALUE 



ACCESSION I DESCRIPTION 



P 

VALUE 



292 I AC0O3037 



Homo sapiens; 
HTGS phase 1, 
66 unordered 
pieces. 



3.40E-05 



SI 3569 



293 



Z81512 



Caenorhabditis 
elegans cosmid 
F25C8, complete 
sequence. 



2.40E-05 



|musdbprc_ 
1 



hypothetical protein 5 - 
Lactococcus lactis 
subsp. lactis insertion 
sequence 

1076>GP:LLTLE_1 
Lactococcus lactis 
DN A for the 
transposon-like element I 
on the lactose plasmid; 
ORF5(AAl-43) 
Mus musculus DNA- 
binding protein Rc 
mRNA, complete cds; 
DN A binding protein 
Rc 



0.018 



294 



B16681 



343C3.TVB 
CIT978SKA1 
Homo sapiens 
genomic clone A- 
343C03. 



1.10E-05 



[copFyeas" 

T 



295 



Z16523 



296 I Z49704 



297 I AC003071 



H. sapiens 
(D9S158)DNA 
segment 
containing (CA) 
repeat; clone 
AFM073ybll; 
single read. 
S.cerevisiae 
I chromosome XIII 
cosmid 8021. 

Human BAC 
clone BK085E05 
from 22ql2.1- 
qter, complete 
1 sequence. 



1.00E-05 I MMSEMFJ 



[COATOMERBETA 
SUBUNIT (BETA*- 
|cOAT PROTEIN) 
I (BETA- 

|COP)>PIR2:B55123 
coatomer complex beta' 

I chain - yeast 
(Saccharomyces 

I cerevisiae)>GPN:SCY 
GL137WJ 
S;cerevisiae 
chromosome VII 
reading frame ORF 
i YGL137w>GP:SCUll 
237_1 Saccharomyces 

| cerevisiae 

M;musculus mRNA tor | 
|semaphorin F; 
Smaphorin F 



0.081 



0.78 



5.60E-06 1 <NONE> <NUiNt^ 



I 3.00E-06 I HSRCAER_1 



H;sapiens mRNA tor 
red cell anion 
exchanger (EPB3,AE1, 
Band 3) 3' non-coding 
I region 



[<NONE> I 



0.21 
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SEQ 
ID 



298 



299 



Nearest 
Neighbor 
(BlastN vs. 
Gen bank) 



U20428 
U51903 



DESCRIPTION 



Human SNC19 
mRNA sequ ence. 
Human RasGAP- 
related protein 
(IQGAP2) 
mRNA, complete 
cds. 



P 

VALUE 



1.40E-06 
6.60E-07 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 
ACCESSION 



HUMMUC2A 
_1 

IQGA^HUMA 
N 



DESCRIPTION 



JUU I AL000805 



h.rubripes GSS 
sequence, clone 
021G08aAl. 



1 4.70E-07 



|MT13_MYTE 
D 



Human mucin-2 gene, 
partial cds 

RAS GTPASE 
ACTIVA TING-LIKE 
PROTEIN IQGAP1 
(P195)>PIR2:A54854 
Ras GTPase activating- 
related protein - 
J human>GP:HUMIQG 
A_l Homo sapiens ras 
GTPase-activating-like 
protein (IQGAP1) 
mRNA, complete cds; 
Amino acid feature: IQ 
calmodulin-binding do 
MfcTALLOTHIONEIN 



P 

VALUE 



4.40E-06 



301TAC003016" 



Human BAC 
clone RG134C19 
from8q21, 
complete 
sequence. 



1 4.30E-07 SPC57AI0 5 



10-111 (MT-10 
III)>PIR2:S39418 
metallothionein I0-III 
blue mussel 



Ml I AC003089 



I AC002074 



I Human BAC 
I clone 

RGI80F08A, 
complete 
I sequence. 
Human BAC 
clone GS056H18 
I from 7q31-q32, 
I complete 
sequence. 



|hpbpreck 

I 



2.40E-07I A47021 1 



220 



S;pombe chromosome I 
cosmid c57AI0; 
Unknown; 
SPAC57A10;05;c, 
unknown, len:606aa, 
similar toA;nidulans 
Q00659, sulfur 
metabolite repression 
control, (678aa), fasta 
scores, opt: 1355, 
Hepatitis B virus type 
1 1 precore protein (pre- 
C region, C) gene, 5 1 
end 



Sequence 23 from 
Patent W09527787; 
Unnamed protein 
product; Author-given 
protein sequence is in 
conflict with the 
conceptual 

translation>GP: A5 1 260 
I 1 Sequence 23 from 

Patent W09614416; 
I Unnamed protein 
I product; Author-given 



2.20E-10 



0.00041 
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Table 2 



PCT/US98/27610 



[Nearest 
I Neighbor 
|(BlastN vs. 
iGenbank) 



[ACCESSION 



U04980 



305 



U68704 



306 



U51583 



307 I M80206 



DESCRIPTION 



P 

VALUE 



[Nearest 
Neighbor 

|(BlastX vs. 
Non- 
Redundant 

I Proteins) 
I ACCESSION 1 



DESCRIPTION 



P 

VALUE 



j protein sequence is l 



Rattus norvegicus 
fetal troponin T 3 
(fetal TnT3) 
mRNA, partial 
cds. 

[Human 
chromosome 
21q22.3 PI -clone 
3804 subclone 4- 
52. 



Rattus norvegicus 
zinc finger 
homeodomain 
enhancer-binding 
protein- 1 (Zfhep- 
1) mRNA, partial 
cds. 



"2.20E-07 |HUMFSHD_1 | Human 

facioscapulohumeral 

muscular dystrophy 
(FSHD) gene region, 
D4Z4 tandem repeat 

unit; ORF 

|HHV6AGNM I Human herpesvirus-C) 
96 |(HHV-6)U1102, 

variant A, complete 
virion genome; U88; 
Cys repeats; this loci is 
open in all six reading 
frames, part oflE-A 
8.70E-08 I AF005370_67 I Alcelaphine 

herpesvirus 1 L-DNA, 
complete sequence; 
Putative immediate 
early protein; ORF73; 
similar to H; saimiri 
and KSHV ORF73 



2.70E-05 



Mus domesticus 
poliovirus 
receptor homolog 
(MPH) mRNA, 
1 complete cds. 



308 I M60854 



309 



U82828 



Human ribosomal 
protein S16 
mRNA, complete 
cds. 



I Homo sapiens 
ataxia 

telangiectasia 
(ATM) gene, 
I complete cds. 



8.10E-08 I 153960 PRR2 alpha -human 



5.70E-08I OLVPOLJ 



1.50E-08 I C40201 



Caprine arthritis 
encephalitis virus 
(isolate OVLV-Nl)pol 
protein gene, 3' end of 
cds; Nt 2497-2695 
from CAEVCo 
artifact-warning 
sequence (translated 
ALU class C) - human 



6.10E-07 



1.70E-28 



0.27 



221 



WO 99/33982 



Table 2 



PCTAJS98/27610 



SEQ 


Nearest 
Neighbor 
(BlastN vs. 
Genbank) 




Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


ID 


ACCESSION 


UIL5CKIF1ION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


310 


Z83836 


Human DNA 
sequence from 
PAC lllJ24on 
chromosome 
22ql2-qter 
contains ESTs. 


1.40E-08 


HSU64473J 


Human rheumatoid 
arthritis synovium 
immunoglobulin heavy 
chain variable region 
mRNA, partial 
cds>GP;HSU64498_l 
Human rheumatoid 
arthritis synovium 
immunoglobulin heavy 


0.34 



311 



250029 



312 I AC002353 



Caenorhabditis 
elegans cosmid 
ZC504, complete 
sequence. 



Homo sapiens; 
HTGS phase 3, 
17 unordered 
pieces. 



313 



B65763 



314 



Z93021 



.4UE-08 IMMU88984 1 



■ — * ^ 

mRNA, partial cds 



fMus musculus NIK 
I mRNA, complete cds 



L70E-50 



1.20E-08I D41132 



CIT-HSP- 
2023A12.TR 
CIT-HSP Homo 
sapiens genomic 
clone 2023 A12. 
Human 



315 



D88035 



DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
516C23;HTGS 
phase 1, 
Rat mRNA for 
glycoprotein 
specific UDP- 
glucuronyltransfe 
rase, complete 
cds. 



3.60E-09 1 S18106 



collagen-related protein O02" 
4 -Hydra 
magnipapillata 
(fragment)>PIR2:S2 1 9 
32 mini-collagen - 
Hydra 

sp>GP:HSNCOL4J 
HydraN-COL4 

I mRNA for mini- 
collagen; No start 

[ codon 

type II site-specific | 0.045 
deoxyribonucleic (EC 
3.1.21.4) Abrl- 
Azospirillum brasilense I 



^.uuti-uy IAB00 1684 J 3 ChlorelJa vulgaris C-27 



I.DOb-091 D88035 1 



chloroplast DNA, 
complete sequence; 
RNA polymerase 
gamma subunit 



Rat mRNA for 
glycoprotein specific 
UDP- 

glucuronyltransferase, 
complete cds 



0.6 



1.00E-33 
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(Nearest 
(Neighbor 
l(BlastN vs. 
1 Gen bank) 



Table 2 



[Nearest 
I Neighbor 
|(BlastX vs. 
iNon- 

I Redundant 
I Proteins) 



SEQ | ACCESSION 
ID 



"DESCRIPTION 



316 



U85193 



I Human nuclear 
factor I-B2 
(NFIB2) mRNA, 
complete cds. 



P 

VALUE 



| ACCESSION 



1.30E-10 I VGF1JBVB 



317 I B04719 



3lT| M73506 
319™! U71148 



cSRL-42G12-u 
cSRL flow sorted 
Chromosome 1 1 
specific cosmid 
Homo sapiens 
genomic clone 
CSRL-42G12. 
Mouse Tcp-lOc (t 
I allele) gene. 
Human Xq28 
cosmids U225B5 
and U236A12, 
complete 
I sequence 



DESCRIPTION 

Fl 

PROTEIN>PIRl:VFlH 
Bl Fl protein - avian 
infectious bronchitis 
virus (strain 
Beaudette)>GP:IBACG 
B_l Avian infectious 
bronchitis virus pol 
protein, spike protein, 
small virion-associated 
protein, membrane 
protein, and 
nucleocapsid protein 
gen 



7.90E-11 



JC5238 



P 

VALUE 

T 



galactosylceramide-like 
protein, GCP - human 



0.31 



2.80E-11 
1.20E-11 



320 | Z95TT6 



[Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
57G9;HTGS 
[phase 1. 



321 I M64795 



I Rat MHC class I 
antigen gene 
(RTl-u 
haplotype), 

I complete cds. 



A39487 
A56547 



T-complex protein 10a I 4.10E-16 
(allele 129) - mouse 

sex-peptide precursor - I 0.4 
Drosophila suzukii 



9.90E-13 



|ALU2_HUM 
AN 



1.70E-14 |STC_DKUME 



!!! ALU 

SUBFAMILY SB 
WARNING ENTRY 



0.0017 



MM 



SHUTTLE CKAh I 
PROTEIN>GP:DMU0 
9306_1 Drosophila 
melanogaster shuttle 
craft protein (stc) 
mRNA, complete cds; 
C-terminal 222 amino 
acids encode a novel 
single- stranded DNA 
binding domain 
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PCT/US98/27610 



SEQ 
ID 



Nearest 
Neighbor 
(BlastN vs. 
Genbank) 



ACCESSIONl DESCRIPTION 



322T Y09036 



323 I U 12523 



324 



Z98755 



IH.sapiens 
NTRK1 gene, 
exon 17. 



I Rattus norvegicus 
ultraviolet B 
radiation- 
activated UV98 

I ITlRWA nartial 

j -J f""*» 

I sequence. 
Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
76C18; HTGS 
phase 1. 



P 

VALUE 



4.20E-15 



2.90E-15 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 



ACCESSION] DESCRIPTION 



AFO 10403 1 



SPBC30D10_ 
4 



225 \ M86917 



Human oxysterol- 
binding protein 
(OSBP) mRNA, 
complete cds. 



szo I AC001231 



327 JAL008626" 



Genomic 
sequence from 
Human 17, 
complete 
I sequence. 



|rpon_hal 

MA 



J.60E-15I CEF14H8_2" 



1.3UE-15 I AC002397_3 



Homo sapiens ALR 
mRNA, complete cds; 
Alternatively spliced; 
similarity to ALL- 1 and 
Drosophila trithorax 
S;pombe chromosome 
II cosmid c30D10; 
Hypothetical protein; 
SPBC30D10;04, 
unknown, len:348aa 



DNA-DIRECTED 
RNA POLYMERASE 
SUBUNITN(EC 
2.7.7.6)>PIR2:D43715 
DNA-directed RNA 
polymerase II chain 
RPBlOhomolog- 
Haloarcula 

marismortui>GP:HAL 
HMAENOA_4 
H;marismortui tRNA- 
Leu, HL29, HmaL13, 
HmaS9, OrfMMV, 
OrfMNA, 2- 

phosphoglycerate 
dehydr 



1 



2.40E-09 



0.019 



Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
1I14G22; HTGS 
| phase 1. 



S.30E-16 I TAU48227J 



Caenorhabditis elegans | 2.10E-18 
cosmid F14H8, 
complete sequence; 
F14H8;1; Similarity to 
Human oxysterol- 
binding protein 
(SW:OXYB HUMAN)! 



Mouse BAC284H12 
Chromosome 6, 
complete sequence; 
DRPLA 



0.0016 



Triticum aestivum 
soluble starch synthase 
mRNA, partial cds 



5.90E-05 
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PCT/US98/27610 



I Nearest 
I Neighbor 
|(BlastN vs. 
IGenbank) 



■ Nearest 
I Neighbor 
|(BlastX vs. 
iNon- 

I Redundant 
I Proteins) 



SEQ 
ID 



ACCESSIONl DESCRIPTION 



L04483 



Human ribosomal 
protein S21 
(RPS21)mRNA, 
complete cds. 



P 

VALUE 



ACCESSION 



7.60E-17 



|RS21_HUMA 
N 



329 I AB001S99 



I Homo sapiens 
PACE4gene, 
exon 2. 



6.70E-17 



|LRP1_HUMA 
N 



DESCRIPTION 



40S RIBOSOMAL 
PROTEIN 
S21>PIR2:S34108 
ribosomal protein S21 - 
human>GP:SSZ84015_ 
1 S;scrofa mRNA; 
expressed sequence tag 
(3'; clone cllglO); 40S 
ribosomal protein S21; 
Similar to human 40S 
ribosomal protein 
S21>GP:HUMRPS21X 
1 Human ribosomal 



P 

VALUE 

1.40E-09 



330 



Z98755 



331 I AF017187 



Human DN A 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
76C18;HTGS 
I phase 1 



4.40E-17I U97553_59 



LOW-DENSITY 
LIPOPROTEIN 
RECEPTOR- 
RELATED PROTEIN 

1 PRECURSOR (LRP) 

(ALPHA-2- 
MACROGLOBULIN 
RECEPTOR) (A2MR) 
(APOLIPOPROTEIN 

E RECEPTOR) 
(APOER)>PIR2:S0239 

2 LDL receptor-related 
protein precursor - 
human>GP :H SLDLRR 
L_l Human mRNA for 

LDL-recept 

Murine herpesvirus 68 
strain WUMS, 
complete genome; 
Ribonucleotide 
reductase large 



Homo sapiens 
LTR HERV-K 
repetitive element 
fragment 
ltr_19_9a 
sequence. 



3.90E-18 I D84255J 



Ovophis okinavensis 
mitochondrial DNA for 
NADH dehydrogenase 
subunit 1, partial cds, I 
lle-tRNA, Pro-tRNA, 
Phe-tRNA,Gln- tRNA, 
Met-tRNA and control 
region (D-loop region); 
This cds 



0.06 



0.007 
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ID 



332 



Nearest 
Neighbor 
(BlastN vs. 
Gen bank) 



ACCESSION 



B36252 



DESCRIPTION 



HS-1038-A2- 
G01-MR.abi CIT 
Human Genomic 
Sperm Library C 
Homo sapiens 
genomic clone 
Plate=CT 820 
CoI=2 Row=M. 



P 

VALUE 



3.10E-18 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 



ACCESSION 



PGBM_MOU 
SE 



DESCRIPTION 



333 



D78255 



334 I AC003046 



I Mouse mRNA for 
PAP-1, complete 

|cds 

Human Xp22 
PACsRPCll- 
263P4and 
RPC11-164K3 
complete 
sequence. 



335 I AC003002 I Human DNA 

from overlapping 
chromosome 19- 
specific cosmids 
R29515and 
R28253, genomic 
sequence, 
complete 
[sequence. 



336 



337 



Y15054 IRattus norvegicus 
mRNA for 70 
kDa tumor 
specific antigen, 
J partial 

Z97876 I Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
295C6; HTGS 



2.70E-18 I MUSPAP1 1 



BASEMENT 
MEMBRANE- 
SPECIFIC HEPARAN 
SULFATE 
PROTEOGLYCAN 
CORE PROTEIN 
PRECURSOR (HSPG) 
(PERLECAN) 
(PLC)>PIR2:S 18252 
heparan sulfate 
proteoglycan - 
mouse>GP:MUSPERP 
A_l Mouse perlecan 
mRNA, complete cds 



P 

VALUE 



0.00015 



I.40E-18I CEC34F6 1 



1.40E-18I MUSZFP0 1 



I Mouse mRNA for 
PAP-1, complete cds 

I Caenorhabditis elegans 
cosmid C34F6; 
C34F6;1;CDNA EST 
yk46bl2;5 comes from 
this gene; cDNA EST 
yk44c4;5 comes from 
this gene; cDNA EST 
yk46b!2;3 comes from 
I this gene 



I Mouse mRNA for zinc 
finger protein, partial 
sequence 



3.50E-I8 



0.0015 



1.30E-19 



3.40E-19I HS4U2IR2 1 



1.30E-19I AF003535 1 



Epstein-Barr virus 
(AG876 isolate) U2- 
IR2 domain encoding 
nuclear protein 
EBNA2, complete cds; 
Nuclear antigen 2 
I Homo sapiens LI 
[element ORF2-like 
protein gene, partial cds 



2.00E-06 
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SEQ 
ID 



I Nearest 
I Neighbor 
|(BlastN vs. 
iGenbank) 



ACCESSION 



M97159 



phase 1 . 



Mouse (clone 
pIL2)Bl 
dispersed repeat 
unit. 



339 



U30817 



DESCRIPTION 



P 

VALUE 



I Nearest 

1 Neighbor 
(BlastX vs. 

1 Non- 
Redundant 

1 Proteins) 
ACCESSION 



1.10E-19 1 A26882 



Bos taurus very- 
long-chain acyl- 
CoA 

dehydrogenase 
mRNA, nuclear 
gene encoding 
mitochondrial 
protein, complete 
cds. 



4.70E-20 I ACDV_RA1 



340 



Y11535 



341 I AL008730 



342 



U96629 



DESCRIPTION 



pIL2 hypothetical 
protein - rat 
(fragment)>GP:RATT 
DR_1 Rat growth and 
transformation- 
dependent mRNA, 3* 
end; Growth and 
transformation 
dependent protein 



H.sapiens mRNA 
for SHOXb 
protein. 



2.80E-20 |ALU1_HUM 
AN 



ACYL-COA 
DEHYDROGENASE, 
VERY-LONG-CHAIN 
SPECIFIC 
PRECURSOR (EC 
1.3.99.-) 

(VLCAD)>PIR2:A548 
72 acyl-CoA 
dehydrogenase (EC 
1.3.99.-) very-long- 
chain-specific 
precursor - 

rat>GP : RA T VLC AD_ 
1 Rat mRNA for very- 
,ong-chain Acyl-CoA 
dehydrogenase, compl 
!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 

Mil 



P 

VALUE 



0.2 



8.10E-25 



Human DNA 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
487 J7; HTGS 
phase 1. 



7.10E-21 I C40201 



Human 

chromosome 8 
BAC clone 
CIT987SK-2A8 
complete 
sequence. 



5.30E-23 



IALU1JHUM 
AN 



artifact-warning 
sequence (translated 
ALU class C) - human 



!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 



mi 



0.00027 



0.001 



3.80E-10 
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Table 2 
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Nearest 
Neighbor 
(BlastN vs. 
Gen bank) 


Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


A^^jLMiuxNj UJfcSCKIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


343 


U95743 1 Homo sapiens 
chromosome 16 
BAC clone 
CIT987-SK65D3, 
complete 
sequence. 


2.10E-24 


UROM HUM 
AN 


UROMODULIN 

PRECURSOR 

(TAMM-HORSFALL 

URINARY 

GLYCOPROTEIN) 

(THP)>PIR2:A30452 

uromodulin precursor - 

human>GP:HUMUMO 

D_l Human 


1 



344 



U 15972 



Mus musculus 
homeobox 
(Hoxa7)gene, 
complete cds. 



I 4.00E-25 I S20790 



U 15972 



Mus musculus 
homeobox 
(Hoxa7)gene, 
complete cds. 



4.00E-25 



|CA24_CAEE 
L 



Horsfall glycoprotein) 
mRNA, complete cds; 
Uromodulin precursor 



extensin 
almond>GP:PAEXTS_ 
1 P;amygdalus mRNA 
for extensin 



0.34 



3 46| Z66242 |H.sapiens CpG 
island DNA 
genomic Msel 
fragment, clone 
84a4, reverse read | 
cpg84a4.rtla. 



4.80E-26 I CEC35A5_8 



COLLAGEN ALPHA 
2(IV) CHAIN 
PRECURSOR>GP:CE 
COLA2IV_2 C;elegans| 
a2(IV) collagen gene; 
Alternatively spliced 
transcript 



0.1 



347 



L25331 Rattus 



norvegicus j 3.90E-26 
iysyl hydroxylase 
mRNA, complete 
cds. 



|LYSH_CHIC 
K 



228 



Caenorhabditis elegans 
cosmid C35A5, 
complete sequence; 
C35A5;8; CDNA EST 
yk31f6;5 comes from 
this gene; cDNA EST 
yk38hl;3 comes from 
this gene; cDNA EST 
yk38hl;5 comes from 
this gene; 



7.70E-19 



PROCOLLAGEN- 
LYSINE,2- 
OXOGLUTARATE 5- 
DIOXYGENASE 
PRECURSOR (EC 
1.14.1 1.4) (LYSYL 
H YDROX YLA SE)>PI 
R2:A23742 
procollagen-lysine 5- 
dioxygenase (EC 
1.14.11.4) precursor - 
chicken>GP;CHKLYH 
.1 Chicken lysyl 



1.10E-43 



BNSDOCID: <WO 9933982A2_L> 



PCT/US98/27610 



WO 99/33982 



I Nearest 
I Neighbor 
|(BlastN vs. 
iGenbank) 



Table 2 



"SEQ 
ID 



ACCESSiONl DESCKlKi iON 



348 I L81569 



349 I U78082 



Drosophila 
melanogaster 
(subclone 2_d7 
from PI DS04260 
(D68))DNA 
sequence, 
complete 
sequence 
I Human RNA 
polymerase 
transcriptional 
regulation 
mediator (h- 
MED6) mRNA, 
complete cds. 



P 

VALUE 



(Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 

1 Proteins) 
ACCESSION 



DESCRIFHOIN 



P 

VALUE 



I hydroxylase mRNA, 
complete cds 



3.30E-26 I CELC52B9__2 



350 I U43381 



Human Down 
Syndrome region 
of chromosome 
21DNA. 



351 I D50416 



Mouse mRNA tor 
AREC3, 
I complete cds. 



'2.30E-26 I HSU78082_1 



Caenorhabditis elegans 
cosmid C52B9; Coded 
for by C; elegans 
cDNA cml ld6; weakly | 
similar to S;cervisiae 
PTM1 precursor 
(SP:P32857) 

I Human RNA 
I polymerase 

transcriptional 

regulation mediator (h- 

MED6) mRNA, 

complete cds; H- 
I Med6p 



2.10E-28 IHSMRNAEB^ 
1 



H;sapiens genomic 
DNA, integration site 
for Epstein-Barr virus; 
Hypothetical protein 



0.18 



2.50E-29 1 A29947 



prostaglandm- 
endoperoxide synthase 
(EC 1.14.99.1) 
precursor - 

sheep>GP:SHPCOXA_ | 
1 Sheep prostaglandin 
endoperoxide 
synthetase 
(cyclooxygenase), 
complete cds; 
Cyclooxygenase 
precursor (EC 
11;14;99;1) 



0.81 
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354 



L09604 



iium cione 
C18D11;HTGS 
phase 1. 



Homo sapiens 
differentiation- 
dependent A4 
protein mRNA, 
complete cds. 



355 



I AF001905 



HS-1055-B2 
G03-MR.abi CIT 
Human Genomic 
Sperm Library C 
Homo sapiens 
genomic clone 
Piate=CT 777 
I Co\=6 Row=N. 

Homo sapiens 
cosmids E079, 
B0920 and A8 
from Xq25 X- 
linked 

lymphoproliferati 
ve disease gene 
candidate region, 

j complete 

[sequence. 



3.70E-32 I PVU72769J 



1.30E-32|CELT05H4_8 



1.80E-33I 138344 



Crithidia fasciculata 
fully edited ATPase 
subunit 6 (MURF4) 
mRNA, partial cds; 
Cryptogene 

S;pombe chromosome I 
cosmidclB3; 
Hypothetical protein; 
SPAC1B3;05 ; probable 
transcriptional 
regulator, Jen:630aa, 
similar eg; to YIL038C,| 
NOT3_YEAST 9 
P06 102, general 
negative regulator, 
Phaseolus vulgaris 
PvPRP-12(Pvpipl-12) 
mRNA, partial cds; 
Similar to cell wall 
proline rich 
protein>GP:PVU72769 
_1 Phaseolus vulgaris 
PvPRP-I2(Pvprpl-12) 
mRNA, partial cds; 
Similar to cell wall 
proline rich protein 
Caenorhabditis elegans 
cosmid T05H4; Similar 
to the beta transducin 
family; coded for by C; 
elegans cDNA 
ykl56ell;3; coded for 
by C; elegans cDNA 
ykl4c8;3; coded for by 
C; elegans cDNA 
titin - human 



0.00049 
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Table 2 



SEQ 
ID 



Nearest 
Neighbor 
(BlastN vs. 
Genbank) 



ACCESSION I DESCRIFI1UIN 



P 

VALUE 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 
ACCESSION I 



DESCRIPTION 



357 



E03743 



358 I U31199 



359 I D14678 



360 I AB000425 



I DN A sequence 

including male 

hormone 

dependent gene 

I derived from 

hamster 
frankorgan. 
Human laminin 
gamma2 chain 
gene (LAMC2), 
exon 22 and 
flanking 

I sequences, 

I Human mRNA 
for kinesin 
related protein, 

1 partial cds 

Porcine DNAtor 
endopeptidase 
24.16, exon 16 
and complete cds. 



1.10E-34|CELC0JA7_2 



iCaenorhabditis elegans 
cosmid C03A7; Weak 
similarity to serotonin 
receptors 



1.20E-35I B44018 



2.00E-36 1 D49544_l 



laminin B2t chain 
human>GP:HSLAMB2 
TB_1 H;sapiens mRNA 
for laminin 



Mouse mRNA tor 
KIFC1, complete cds 



P 

VALUE 



0.59 



1.20E-14 



8.20E-38 |POL4_DROM 
E 



[RETROVIRUS^ 
RELATED POL 
POLYPROTEIN 
(PROTEASE (EC 
3.4.23.-); REVERSE 
TRANSCRIPTASE 
(EC 2.7.7.49); 
ENDONUCLEASE) 
(TRANSPOSON 
412)>PIR1:GNFF42 
retrovirus-related pol 
polyprotein - fruit fly 
(Drosophila 
melanogaster) 
transposon 
412>GP:DMRT412G_ 

4 



"361 | U39875 I Rattus norvegicus 
EF-hand Ca2+- 
binding protein 
p22 mRNA, 
complete cds. 



' 8 80E-42 I 156333 apolipoprotein B - rax 

(fragment)>GP:RATA 
POLPB_l Rattus 
norvegicus (clone 
rb9E) apolipoprotein B 
apoB mRNA, 3' end 



0.65 
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PCT/US98/27610 



:. :,: : : ^ ; f>ll 


Neighbor 
(BlastN vs. 
Genbank) 






Nearest 
Neighbor 
fBlastX vc 
Non- 

Redundant 

Proteins) 

ACCESSION 






SEQ 
ID 


ACCESSION 


DESCRIPTION 


1 * 
J VALUE 


1 DESCRIPTION 


P 1 
VALUE 1 


362 


L09647 


Rattus norvegicus 
hepatocyte 
nuclear factor 3a 
(HNF-3 beta) 
mRNA, complete 
cds. 


6.60E-42 


HN3BRAT 


HEPATOCYTE 
NUCLEAR FACTOR 
3-BETA (HNF- 
3B)>GP:RATHNF3B_ 
1 Rattus norvegicus 
hepatocyte nuclear 
factor 3a (HNF-3 beta) 
mRNA, complete 
cds>TFD:TFDP01611 


8.10E-25 1 



363 



364 



"525538 (Human mRNA 
for KIAA0037 
gene, complete 

[cds. 

H.sapiens CpG 
island DNA 
genomic Msel 
fragment, clone 
13f7, reverse read j 
cpgl3f7.rtla. 



|CELC34D4J 
2 



factor HNF-3 (beta) 
Caenorhabditis elegans 
cosmid C34D4 



1.40E-43 I S75263 



366 I J05499 



AC002636 |*** 

SEQUENCING 
IN PROGRESS 
*** Drosophila 
melanogaster 
(subclone 2_g4 
from PI DS03323 
(D127))DNA 
sequence; HTGS 
phase 2. 

Rattus norvegicus 
L-glutamine 
amidohydrolase 
mRNA, complete 
cds. 



hypothetical protein - 
Synechocystis sp. (PCC 
6803)>GP:D90904_29 
Synechocystis sp; 
PCC6803 complete 
genome, 6/27, 630555- 
781448; Hypothetical 
protein; 

ORFJD:sll0983 



8.40E-44 |DMU95760J Drosophila 

melanogaster 
strawberry notch (sno) 
mRNA, complete cds; 
Notch pathway 
component; nuclear 
protein 



0.018 



0.0028 



8.00E-44I GLSL_RAT 



ULUTAMINASE, 
LIVER ISOFORM 
I PRECURSOR (EC 
3.5.1.2) 

(GLS)>GP:RATGAH_ 
1 Rattus norvegicus L- 
glutamine 
amidohydrolase 
| mRNA, complete cds 



3.40E-51 



8.00E-29 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 






SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 


367 


U95760 


Drosophila 
melanogaster 
strawberry notch 
(sno) mRNA, 
complete cds. 


5.00E-45 


DMU95760J 


Drosophila 
melanogaster 
strawberry notch (sno) 
mRNA, complete cds; 
Notch pathway 
component; nuclear 
protein 


A OAC AC 


368 


L10106 


Mus musculus 
protein tyrosine 
phosphate 
mRNA, complete 
cds. 


4.10E-45 


PTPK HUMA 
N 


PROTEIN-TYROSINE 
PHOSPHATASE 
KAPPA PRECURSOR 
(EC 3. 1.3.48) (R-PTP- 
KAPPA)>GP:HSPTPK 
AP_1 H;sapiens mRNA 
for phosphotyrosine 
phosphatase kappa; 
Human 

phosphotyrosine 
phosphatase kappa 


4.70E-16 


369 


D17218 


Human HepG2 3' 
region Mbol 
cDNA, clone 
hmd3g02m3. 


9.40E-47 


MMU53563J 


Mus musculus Brgl 
mRNA, partial cds; N- 
terminal region of the 
protein 


0.00012 


370 


U78310 


Homo sapiens 
pescadillo 
mRNA, complete 
cds. 


8.10E-48 


HSU78310_1 


Homo sapiens 
pescadillo mRNA, 
complete cds 


1.10E-21 


371 


AC000399 


Genomic 
sequence from 
Mouse 9, 
complete 
sequence. 


7.40E-48 


KIP2_YEAST 


KINESIN-LIKE 
PROTEIN 
KIP2>PIR1:C42640 
kinesin-related protein 
KIP2 - yeast 
(Saccharomyces 
cerevisiae)>GP:SCKIP 
2XVI_2 S;cerevisiae 
PEP4 and KIP2 genes 
encoding PEP4 
proteinase (partial) and 
kinesin-related protein 
KIP2>GP:SCLACHX 
Vl_17S;cerev 


0.14 


372 


AC002327 


♦ ♦ ♦ 

SEQUENCING 
IN PROGRESS 
*** Genomic 
sequence from 
Mouse 7; HTGS 
phase 1, 3 


1 A ftC A Q 

1 ,4Ub-4o 


CriNA- 1 AZU D 
1 


r^liirlfpn tvoe-1 

collagen; amino acids - 
16 to 3; Procollagen 
alpha-2 


0.024 



233 
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Table 2 



. ''■':'.:'y 


Nearest 
Neighbor 

Genbank) 




Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P 

VALUE 


ACCESSION 


DESCRIPTION 


P 

VALUE 






unordered pieces. 










373 


X67016 


H.sapiens mRNA 
for amphiglycan. 


9.00E-49 


CED2085_2 


Caenorhabditis elegans 
cosmid D2085, 
complete sequence; 
D2085;l; Similar to 


0.14 



374 



LI 0409 



Mouse fork head | 1.50E-49 
related protein 
(HNF-3beta) 
mRNA, complete 
cds. 



carbamoyl-phosphate 
synthase, aspartate 
carbamoyltransferase, 
dihydroorotase; cDNA 
EST 

cm 1 6f3>GP:CED2085 
I 2 Caenorhabditis 
elegans cosmid D2085; 
p 

|MMU04197_1 Mus musculus HNF3 rL20E^30~ 
beta transcription factor | 
(HNF3b) mRNA, 
partial cds; Sequence of| 
this partial cDNA 
begins in the first third 
of the conserved 
HNF3/forkhead DNA 
| binding domain 



U01139 



Mus musculus 
B6D2F1 clone 
2C1 IB mRNA. 



1.20m* I SFBC3D5_14 | S;pombe chromosome 

II cosmid c3D5; 
Unknown; 
SPBC3D5;14c, 
unknown; partial; 
serine rich, len:309aa, 
similar eg; to 
YNL283C, 
YN23_YEAST, 
P53832, hypothetical 
52;3 kd protein, 
(503aa), 



0.00091 



Human DNA 
sequence from 
PAC 326L13 
containing brain- 
4 mRNA ESTs 
and polymorphic 
CA repeat. 



9.00E-50 I BSU55043 3 



J Bacillus subtilis 
plasmid pPOD2000 
Rep, RapAB, RapA, 
ParA, ParB, and ParC 
genes, complete cds; 
ORF3 



0.025 
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[Nearest 
1 Neighbor 
|(BlastN vs. 
1 Gen bank) 



(Nearest 
I Neighbor 
|(BlastX vs. 
|Non- 

I Redundant 
I Proteins) 



I ACCESSION I DESCRIFI ION 



P 

VALUE 



Z99289 



| Human DN A 
sequence *** 
SEQUENCING 
IN PROGRESS 
*** from clone 
142L7; HTGS 
phase 1. 



I ACCESSION 



DESCRIPTION 



70E-50 



A64431 



hypothetical protein 
MJ1050- 
Methanococcus 
jannaschii>GP:MJU67 
548_2 Methanococcus 
jannaschii from bases 
986219 to 996377 
(section 90 of 150) of 
the complete genome; 
M; jannaschii predicted 
coding region MJ1050; 
Identified by 
GeneMark; putativ 



P 

VALUE 



5.60E-05 



X98260 



H.sapiens mRNA 
for M-phase 
phosphoprotein, 
mppll. 



6.20E-50 IZRF1J 



MOUS 
E 



ZUOTIN RELATED 
FACTOR>GP:MMU53 
208_1 Mus musculus 
zuotin related factor 
(ZRFl)mRNA, 
complete cds; Similar 
to DnaJ encoded by 
GenBank Accession 
Number L16953 



M18981 



I Human prolactin 
receptor- 
associated protein 
(PRA)gene, 
complete cds. 



9.00E-52 



|S106_HUMA 
N 



CALCYCLIN 
(PROLACTIN 
RECEPTOR 
ASSOCIATED 
PROTEIN) (PRA) 
(GROWTH FACTOR- 
INDUCIBLE 
PROTEIN 2A9) (S100 
CALCIUM-BINDING 
PROTEIN 
A6)>PIR1:BCHUY 
calcyclin - 

human>GP:HUMCAC 
Y_l Human calcyclin 
gene, complete 
cds>GP:HUMCACYA 
_1 Human prolactin 
recept 



380 I AB006622 



Homo sapiens 
mRNA for 
KIAA0284 gene : 
1 partial cds. 



1.60E-53 



S33015 



hypothetical protein - 
human herpesvirus 4 
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383 I D87450 



Human sorting 
nexin 1 (SNX1) 
mRNA, complete 
cds. 

Human DNA 
sequence from 
PAC 435C23 on 
chromosome X. 
Contains ESTs. 
Human mRNA 
forKIAA0261 
gene, partial cds. 



sorting nexin 1 
human>GP:HSU53225 
_1 Human sorting 
nexin 1 (SNX1) 
mRNA ? complete cds 



384 I AC002301 *** 



4J0E-66I D87450 l' 



385 I L29766 



386 I U58884 



SEQUENCING 
IN PROGRESS 
*** Human 
chromosome 
+16pll.2 BAC 
clone CIT987SK- 
A-328A3;HTGS 
phase 2, 1 
[ordered pieces. 



9.80E-57 1 S62328 



Lentinus edodes 
Le;MFBl mRNA, 
complete cds 



Human mRNA for 
KIAA0261 gene, 
partial cds; Similar to 
D;melanogaster parallel 
sister chromatids 
[protein 



9.20E-50 



kinesin-like DNA 
binding protein KID - 
human>GP:HUMKID_ 
1 Human mRNA for 
Kid (kinesin-like DNA 
binding protein), 
complete cds 



4.30E-30 



2.60E-27 I 



| Homo sapiens 
epoxide hydrolase 
(EPHX)gene, 
complete cds. 
Mus musculus 
SH3-containing 
protein SH3P7 
mRNA, complete 
cds. similar to 
Human Drebrin. 



(HSBC1CF4J Homo sapiens mRNA' 
for hTCF-4 



J.JUJti-58 ImMU58884 f 



387 I Y15054 



Rattus norvegicus 
mRNA for 70 
kDa tumor 
specific antigen, 
partial. 



Mus musculus SH3 
containing protein 
SH3P7 mRNA, 
complete cds; similar to 
Human Drebrin; SH3- 
containing protein; 
similar to human 
drebrin 



KNY 15054_1 Rattus norvegicus 
mRNA for 70 kDa 
tumor specific antigen, 
partial; 70 kD tumor- 
| specific antigen 



2.30E-05 



4.70E-45 
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I Nearest 
I Neighbor 
|(BlastN vs. 
I Gen bank) 



Table 2 



SEQ 
ID 



I ACCESSION 



DESCRIPTION 



P 

VALUE 



388 I AC000406 



SEQUENCING 
IN PROGRESS 

** Human 
Chromosome 1 1 
overlapping pacs 
pDJ235kl0and 
P DJ239b22; 
HTGS phase 1, 
17 unordered 
pieces. 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 



ACCESSION 



.40E-59 1 <NONE> 



DESCRIPTION 



<NONE> 



P 

VALUE 

<NONE> I 



389 



L42612 



Homo sapiens 
keratin 6 isoform 
K6f (KRT6F) 
mRNA, complete 
cds. 



.60E-59 1 KRHUEA 



keratin, type II 
cytoskeletal - human 
(fragment)>GP:HSKE 
RA_1 Human 
messenger fragment 
encoding cytoskeletal 
keratin (type II); 
mRNA from cultured 
epidermal cells from 
human 

foreskin>GP:HUMKE 
R56KJ Human 56k 
cytoskeletal type II 
keratin mRNA _ 



390 I L29766 



Homo sapiens 
epoxide hydrolase 
(EPHX) gene, 
complete cds. 



2.70E-60 



|EGR2_HUMA 
N 



391 I L08758 



Mus musculus 
homeobox protein 
(HoxAlO)gene, 
5' end of cds, 



1.40E-60 



IPAALGYGE 
N 1 



EARLY GROWTH I 7.80E-06 
RESPONSE PROTEIN 
2 (EGR-2) (KROX-20 
PROTEIN) 

(AT591)>GP:HUMEG 
R2A_1 Human early 
growth response 2 
protein (EGR2) 
mRNA, complete 
cds>TFD:TFDP00485 
- Polypeptides entry for 

factor Egr-2 

P;aeruginosa algY I 0.00031 
gene; Alginate lyase 



392 I 129058 



Sequence 3 from 
patent US 
5576423. 



4.20E-61 | JC5106 stromal cell-derived 

factor 2 - 

human>GP:D50645_ 
Human mRNA for 
SDF2, complete cds; 
Stroma cell-derived 



1 
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~394~1 U46067 



395 I U40747 



Capra hircus 
beta-mannosidase I 
mRNA, complete 
cds. 



i.yut>62|CHU46067j" 



stromal cell-derived 
factor 2 - 

human>GP:D50645_ 
Human mRNA for 
SDF2, complete cds; 
Stroma cell-derived 
factor-2 

Capra hircus beta- 
mannosidase mRNA, 
complete cds 



I.50E-32 



Mus musculus 
formin binding 
protein 1 1 
mRNA, partial 
cds. 



I 6.90E-63 I S64713 



396 



M36164 



Human 

glyceraIdehyde-3- J 
phosphate 
J dehydrogenase 
mRNA, 3' flank. 



1.10^-63 I BHT1UL 12 



397 



Y09036 



H.sapiens 
NTRK1 gene, 
exon 17. 



/.JUE-65 ImMU39060J 



tbrmin binding protein 3.00E-46 
1 1 - mouse ' 

(fragment)>GP:MMU4 

0747J Mus musculus 

formin binding protein 

1 1 mRNA, partial cds; 

FBP 11; Formin 

binding protein 11; 

tandem WWP/WW 

domains separated by 

15 amino acid linker 

Bovine herpesvirus | 0.003 

type 1 UL22-35 genes; 

UL26;5>GP:BHU3180 , 
9_2 Bovine herpesvirus 

1 maturational 
proteinase (UL26) 
gene, complete cds, and 
scaffold protein 
(UL26;5)gene, 
complete cds 



238 



Mus musculus 
glucocorticoid receptor 
interacting protein 1 
(GRIP 1) mRNA, 
complete cds; 
Hormone-dependent 
interaction with 
hormone binding 
domains of steroid 
receptors; 



0.0054 



BNSDOCID: <W0_9933982A2J > 



WO 99/33982 



Table 2 



PCTAJS98/27610 



I Nearest 
1 Neighbor 
l(BlastNvs. 
* iGenbank) 



I Nearest 
I Neighbor 
|(BlastX vs. 
|Non- 

1 Redundant 
[Proteins) 



SEQ 
ID 



I ACCESSION 



398 



U17901 



DESCRIPTION 



P 

VALUE 



I ACCESSION 



transactivation 



Rattus norvegicus 
phospholipase A- 
2-activating 
protein (plap) 
mRNA, complete 
cds. 



399 



D 12646 



Mouse kit4 
mRNA for 
microtubule- 
based motor 
protein KIF4, 
complete cds. 



2.70E-70 1 JC4239 



1.70E-74 



|KIF4_MOUS 
E 



400 I AF007SbU 



401 



145565 



Xenopus laevis 
xl-Mago mRNA, 
I complete cds. 



Sequence 15 from 
patent US 
5637463. 



4.60E-75 I AF007862J 



DESCRIPTION 



P 

VALUE 



phospholipase A2- 
activating protein - rat 



8.40E-17 



2.30E-82 I RNU57391J 



K1NESIN-LIKE 
PROTEIN 
KIF4>PIR2:A54803 
microtubule-associated 

motor KIF4 - 
mouse>GP:MUSKIF4 - 
1 Mouse kif4 mRNA 
for microtubule-based 
motor protein KIF4, 
complete cds; ATP- 
binding site: base980- 
1037, motor domain: 
base732-1781,alpha- 

helical co 

Mus musculus mm 
Mago mRNA, 
complete cds; Similar 
to Drosophila 
melanogaster Mago 

protein 

Rattus norvegicus 
FceRI gamma-chain 
interacting protein 
SH2-B(SH2-B) 
mRNA, complete cds; 
Putative FceRI gamma 
IT AM interacting 
protein; SH2 domain- 
containing protein B; 
Method: conceptual 



1.10E-44 



6.50E-68 
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SEQ 
ID 



402 



403 



Nearest 
Neighbor 
(BlastN vs. 
Genbank) 



ACCESSION 



U29156 



U70139 



DESCRIPTION 



Mus musculus 
epsl5RmRNA, 
complete cds. 



404 



U82626 



Mus musculus 

niitntivA PPD/i 

I — • — wvi\T 

protein mRNA, 
partial cds. 



Rattus norvegicus 
basement 
membrane- 
associated 
chondroitin 
proteoglycan 
Bamacan mRNA, 
complete cds. 



P 

VALUE 



L00E-85 



1.00E-85 



Nearest 
Neighbor 
(BlastX vs. 
Non- 
Redundant 
Proteins) 
ACCESSION 



MMU29 156 J 



MMU70139J 



DESCRIPTION 



Mus musculus epslSR 
mRNA, complete cds; 
Involved in signaling 
by the epidermal 
growth factor receptor; 
Method; conceptual 
translation supplied by 
author 



7.60E-96 



Mus musculus putative 
CCR4 protein mRNA, 
partial cds; Similar to 
yeast transcription 
factor CCR4; 
transcriptional 
readthrough occurs 
with transcription being 
initiated at the IAP and 
continues 
Rattus norvegicus 
basement membrane- 
associated chondroitin 
proteoglycan Bamacan 
mRNA, complete cds; 
Chondroitin sulfate 
proteoglycan; CSPG 



4.90E-62 



7.20E-66 



8.20E-58 



>: <WO_9933982A2J_> 
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Table 2 



^phhnr miastN vs. Gen bank) i Nearest Neighbo r (BlastX vs. Non-Kedunda nt Proteins), 
Ne arest Neign i p VALUE I ACCESSION I DESCRIP TION 1 P VALUE. 



DESCRIPTION 



4051 L09604 



406 AB000516 



Homo sapiens 
differentiation- 
dependent A4 
protein mRNA, 
complete cds. 



Homo sapiens 
mRNA for DSIF 
pi 60, complete 
cds 



2.00E-35 



<NONE> 



<NONE> 



TT\ POLGJTUMVQ 



4071 Z94753 



Human DNA 
sequence from 
PAC 465G10on 
chromosome X 
contains Menkes 
Disease (ATP7A) 
putative Cu++- 
transporting P- 
type ATPase 
exons 22, 23 and 
STS 



0.004 



<NONE> 



GENOME 
POLYPROTEIN 
(CONTAINS: N- 
TERMINAL 
PROTEIN; HELPER 
COMPONENT 
PROTEINASE (EC 
3.4.22.-) (HC-PRO); 
42-50 KD 
PROTEIN; 
CYTOPLASMIC 
INCLUSION 
PROTEIN (CI); 6 
KD PROTEIN; VPG 
PROTEIN; 
NUCLEAR 
INCLUSION 
lPRO TEIN A (NI-A) 
<NONE> 



<NONE> 



<NONE> 



408 I AB011123 



409| D17218 



4ToT M95098 



Homo sapiens 
mRNA for 
K1AA0551 
protein, partial 
cds 

Human HepG2 3' 
region Mbol 
cDNA, clone 
hmd3g02m3 
Bos taurus 
lysozyme gene 
(cow 2), complete 
cds 



Ml 15 CAEEL 



Tl23 NARGJBACSU 



1.1 



HAIR MOUSE 



IQ23356 
caenorhabditis 
elegans. 

serine/threonine- 
Iprotein kinase mig 
15(ec 2.7.1.-) i n/9 8 
NITRATE ~ 
REDUCTASE 
ALPHA CHAIN (EC 
1.7.99.4) 
HAIRLESS 
PROTEIN 



2.00E-51 



9.9 



8.00E-10 
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Table 2 



Nearest Neighbor (BJastN vs. G^ntanB 



stg ACCESSION 
ID 



DESCR1PJJON I P VALUE ACCEslo^ 



H.sapiens CpG 
DNA, clone 
1 87a9, reverse 
read 

cpgl87a9.rtla. 
P.magnus gene 
for protein urPAB 

Notophthalmus 
viridescens 
mRNA for 
cytokeratin 8 



hgret Me.ghbor (Blasts vs. Non-Redun dlr^PT^T 
—Tumu*, i DES CRIPUUN | P VALUE 



4.00E-54 



<us I AB007157 



416 I X86340 



Xenopus laevis 
(clonepXECl.3) 
C protein mRNA, 
complete cds. 
iHomo sapiens 
gene for 

ribosomal protein 
S21, partial cds 
H.sapiens C7 
[gene, exon 13 

lHumanCsa-19 
mRNA, complete 
cds. 



U.U14 I YPT2_CAEEL~ 



°- 3 7 I VA53_SCHPO~ 



HEPATDCYTE 
NUCLEAR 
FACTOR 3-BETA 
(HNF-3B) 

HYPOTHETICAL 
21.6 KD PROTEIN 
F37A4.2 IN 

CHROMOSOME III 
HVPOTHETICAL 
24.2 KD PROTEIN 
CI3A1I.03IN 



037 [' PDR5_YEAST 



run rw ^/-xo. y-v» _ 



4.00E-21 



e-142 



SUPPRESSOR OF 
TOXICITY OF 
SPORIDESMIN 

Kara INHUMAN 40S RIBOSOMAL ' 
PROTEIN S21 



3.3 



4.3 



418 | U95102 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 



8.00E-08 



a i V_DROMB SHU] TLE CRAFT" 

[PROTEIN 

R10A_PIG 60S RIBOSOMAL I 9 00E-57 
PROTEIN L10A 
(CSA-19) 
(FRAGMENT) 



<NONE> 



<NONE> <NONE>' 



419 1 M8QI98 |Human FKBP- 12 I 5.00E-14 
pseudogene, clone 
|lambda-512, 5* 
flank and 
[complete cds. 



^uijsiEUCR 1RANSCRIPTIONA 0 008 
L REPRESSOR 
RCO-1 



W AF052573 



«i AF035940 



(Homo sapiens 
DNA polymerase 
eta (POLH) 
mRNA, complete 
cds 



0 



<NONE> 



<NONE> f <NONE>~ 



422 AF054994 



Homo sapiens 
MAGOH mRNA, , 
complete cds 



e-131 



mun_droml Imago nashi" 

PROTEIN 



4.00E-39 



Homo sapiens 
clone 23832 
JmRNA sequence 



0.12 



<NONE> 



<NONE> <NONE> 
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Table 2 



(SEQ 
ID 



Neare st Neighbor (BlastN vs. Genbank) 
ACCESSION | DESCRIPTION 



"4231 095098 



"424| U95094 



"425| D43952 



"4261 X68553 



427 M83314 



[Nearest Ne ighbor (BlastX vs. Non-Redundant Proteins)! 
P VALUE I ACCESSION 1 DESCRIPTION 1 P VALUE ' 



428 I AF070636 



429 <NONE> 



iXenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 

cds 

Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 

|cds 

Mouse gene tor 
reticulocalbin, 
exonl and 
promoter region 
C.elegans 
repetitive DNA 
sequence 



Tomato 
phenylalanine 
ammonia lyase 
(pal) gene, 
complete cds and 
promoter region. 
Homo sapiens 
clone 24686 
lmRNA sequence 
<NONE> 



6.00E-05 



430 AF068627 



431 AF020043 



Mus musculus 
DNA cytosine-5 
methyltransferase 
3B2 (Dnmt3b) 
mRNA, 
alternatively 
spliced, complete 

|cds 

Homo sapiens 
chromosome- 
I associated 
polypeptide 



0.36 



0.4 



3.3 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



TCB1 RABIT T-CELL 

RECEPTOR BETA 

CHAIN 

PRECURSOR (ANA | 

m 

SMB2 HUMAN DNA-BINUINU 

PROTEIN SMUBP-2 
(GLIAL FACTOR-l)| 
(OF-1) 



0.11 



0.65 



<NONE> 



<NONE> 



5.00E-04 



IQGA HUMAN RAS UirASt- 
ACTIVATING- 
LIKE PROTEIN 
IQGAP1 (P195) 

LOXl^LENCU LIPOXYGENASE 



2.00E-06 



9.9 



(EC 1.13.11.12) 



0 



YJH4 YEAST HYPOTHETICAL 4.UUi>io 
141.3 KD PROTEIN 1 
IN SCP160-MRPL8 
INTERGENIC 
REGION 



432 | K00046 Iross river virus 
26s subgenomic 
rna and junction 
|region. 



0.12 



CUL2_HUMAN CULL1N 

HOMOLOG 2 
(CUL-2) 



7.4 
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438 AF032873 



W<L AF009288 



Aquitex aeolicus 
section 82 of 109 
of the complete 
genome 
Dictyostelium 
discoideum V- 
ATPase A subunit 
(vatA) mRNA, 
complete cds 
Homo sapiens 
uncoupling 
protein 3 (UCP3) 
gene, exon 1 and 
partial exon 2 
Porcine DNA for 
endopeptidase 
24.16, exon 16 
and complete cds 

Mus musculus 11- 

zinc-finger 

transcription 

actor 

Homo sapiens 
ubiquitin 
conjugating 
enzyme G2 
Homo sapiens 
clone HEB8 Cri- 
du-chat region 
mRNA 



"^12 VCAP_HSV6U 



0-13 WEEl_SCHPO 



4.00E-32 <NONE> 



0.04 



^.UUfc-34 LMG1_HUMAN 



MAJOR CAPSID | JT 

PROTEIN (MCP) 



MITOSIS 1 37- 

INHIBITOR 
PROTEIN KINASE 
WEE 1 (EC 2.7.1..) 

<NONE> I <NONE> 



<NONE> <NONE> 



<NONE> f <NONE>" 



LAMININ 
GAMMA- 1 CHAIN 
PRECURSOR 
(LAMININ B2 
CHAIN) 



8.1 
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Table 2 



^, KI , a Nv..r,enba.*> j Nearest Neifelib or ( BlastX^s^No n-Reduntlan tP^^^) 



| SEQ I ACCESSION 
ID 



443 I AF02457S 



~444~1 M24486 



"445| X96400 
446 <NONE> 



"4471 X84996 



DESCRIPTION 



Homo sapiens 
type-1 protein 
phosphatase 
skeletal muscle 
glycogen 
targeting subunit 
(PPPlR3)gene, 
exon 4, and 
complete cds 



Human prolyl 4- 
hydroxylase alpha 
subunit mRNA, 
complete cds, 
clone PA-11. 



P VALUE ACCESSION 



1.1 



<NONE> 



<NONE> 



DACHA 



|<NONE> 



<NONE> 



4.00E-58 



.tetraurelia 
alpha-51Dgene 

<NONE> 



0.37 



<NONE> 



<NONE> 
<NONE> 



<NONE> 



<NONE> 



448 AF0199SU 



"4491 X78424 



450 <NONE> 



X.laevis mRNA 
for selenocysteine | 
tRNA acting 
factor (Staf) 



Dictyostelium 
discoideum ZipA 
(zipA) gene, 

partial cds 

D.carota (Queen 
Anne's Lace) 
Inv*Dc2 gene, 
34 32bp 

<NONE> 



0.12 



POL MLVRD 



"45T1 X89886 



"4521 U67471 



453 AF060246 



3.4 



0.38 



<NONE> 



HMDL_BRAFL 



<NONE> 



<NONE> 



1POL 

POLYPROTEIN 
(PROTEASE (EC 
3.4.23.-); REVERSE 
TRANSCRIPTASE 
(EC 2.7.7.49); 
RIBONUCLEASEH 
(EC 3.1.26.4)) 

homeobox 
protein dll 
Ihomolog 

<NONE> 



<NONE> 



P.patens mRNA 
for 5- 

aminolevulinate 
dehydratase 



1.1 



CKR6 HUMAN 



C-C CHEMOKJNfc 
RECEPTOR TYPE 6 
(C-C CKR-6) 
|(CCR6) 



Methanococcus 
jannaschii section | 
13 of 150 of the 
complete genome 

Mus musculus 
strain C57BL/6 
zinc finger protein | 
106 (Zfpl06) 
mRNA, H3a-a 
allele, complete 
cds 



0.12 



YR72 ECOLI 



1.00E-62 YOJ8_CAEEL 



HYPOTHETICAL 
53.2 KD PROTEIN 
(ORF2) (RETRON 
EC67) 

HYPOTHETICAL 
51.6 KD PROTEIN 
ZK353.8IN 
CHROMOSOME III 



<NONE> 



<NONE> 



2.00E-08 



0.23 



<NONE> 



<NONE> 
9.9 



5.8 



1.7 
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SEQ 
ID 



454 



455 



456 



458 



Nearest Neighbor (BlastN vs. GenbankV 
ACCESSION | DESCRIPTION I P VALUE 



U70667 



M95858 



U67594 



AUO/47 

Z65575 



Human Fas-liganc 
associated factor 
1 mRNA, partial 
cds 



Bos taurus 
recoverin mRNA, 
complete cds. 

Methanococcus 
jannaschii section 
136 of 150 of the 
complete genome 



459 I X88893 



460 M57426 



Human hnRNP 
core protein A 1 
H.sapiens CpG 
DNA, clone 47c5 ; 
reverse read 
cpg47c5.rtla 



0.36 



nearest Neighbor (BlastX vs. Non-Redundant Pr oteinO 



ACCESSION 



YKB2 YEAST 



GIDA_MYCGE 



<NONE> 



ASCRIPTION 



'HYPOTHETICAL 
69.1 KD PROTEIN 
IN PUT3-CCE1 
INTERGENIC 
REGION 



GLUCOSE 
INHIBITED 
DIVISION 
PROTEIN A 
<NONE> 



P VALUE 



3.00E-09 



461 I X01638 



462 JM60064" 



C.jacchus intron 4 
of visual pigment 

gene 

Maize stripe virus 
RNA3 
nonstructural 
protein 



Yeast TEF1 gene 
for elongation 
factor EF-1 alpha 



464 J L10106 



S.typhimurium 
glutamate 1- 
semialdehyde 
aminotransferase 
(hemL) gene, 
complete cds. 

Kabbit mRNA for 
aminopeptidase N 
(partial) 



3.00E-31 
73 



5.00E-15 



0.33 



1.1 



<NONE> 
<NONE> 



<NONE> 



USC2_MOUSE 



<NONE> 
<NONE> 



r^UL_DROME 



<NONE> 



DESMOCOlXiN" 
2A/2B 

PRECURSOR 
(EPITHELIAL 
TYPE 2 

DESMOCOLLIN) 



1.1 



Mus musculus 
protein tyrosine 
phosphate 
mRNA, complete 
cds. 



0.36 



2.00E-58 



EPB4_MOUSE 



POLY(ADP- 
RIBOSE) 

POLYMERASE (EC 
2.4.2.30) (PARP) 1 



A CHGXENL A 



VG13_BPML5 



EPHR1N TYPE-B 
RECEPTOR 4 
PRECURSOR (EC 
2.7.1.1 12) KINASE 
2) (TYROSINE 
KINASE MYK- 1) 

ACETYLCHOLINE 
RECEPTOR 
PROTEIN, GAMMA 
CHAIN 
PRECURSOR 



GENE 13 PROTEIN 
(GP13) 



1.4 
<NONE> 

<NONE> 
<NONE> 



<NONE> 



6.5 



3.5 



2.5 



1.5 



2.5 
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Table 2 



SEQ 
ID 



Near est Neighbor (BlastN vs. Genbank) 
[ACCESSION | DESCRIPTION I P VALUE 



Nearest N eighbor (BlastX vs. Non-Redundant Proteins)! 
ACCESSION | DESCRIPTION 



4651 M77235 



4661 M58330 



4671 X51508 



4681 L10106 



(Human cardiac 
tetrodotoxin- 
insensitive 
voltage-dependent 
sodium channel 
alpha subunit 
(HHl)mRNA, 
complete cds. 
C.maltosa 
autonomously 
replicating 
sequence. 



Rabbit mRN A tor 
aminopeptidase N 
(partial) 



Mus musculus 
protein tyrosine 
phosphate 
mRNA, complete 
cds. 



3.8 



ZPBOC1 



<NONE> 



0.004 



EPB4 MOUSE 



0.35 



ACHG XENLA 



7.00E-59 VGLTPRVRT 



EPHRIN TYPE-B 
RECEPTOR 4 
PRECURSOR (EC 
2.7.1.112) KINASE 
2) (TYROSINE 
KINASE MYK- 1) 

ACETYLCHOLINE 
RECEPTOR 
PROTEIN, GAMMA 
CHAIN 
PRECURSOR 
GLYCOPROTEIN 
GP63 PRECURSOR 



P VALUE I 



4691 U65939 



Azotobacter 
vinelandii GTPase) 
(ftsA) gene, 
partial cds, and 
ATP binding 
protein (ftsZ) 
gene, complete 
cds 



1.1 



TRUA BACSP 



Q45557 bacillus sp. 
(strain ksm-64). trna 
pseudouridine 
synthase a (ec 
4.2.1.70) 
(pseudouridylate 
synthase i) 
(pseudouridine 
synthase i) (uracil 
hydrolyase). 11/98 



4701 U51037 



471 M32685 



Mus musculus 11- 
zinc-finger 
transcription 

factor 

Human platelet 
glycoprotein Ilia, 
exon 14. 



0.041 



3.6 



<NONE> 



<NONE> 



6.9 



2.4 



2.4 



4.3 



0.001 



<NONE> 



<NONE> 



<NONE> 
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U82691 




D85430 



Phrynocephalus 
raddei CAS 

1 79770 NADH 
dehydrogenase 
subunit 1 (ND1), 
partial cds, tRNA 
Gin, tRNA-Ue 
and tRNA-Met, 
NADH 

dehydrogenase 
subunit 2 tRNA- 
Cys and tRNA- 
Tyr and c, 



» ¥~ . 

iTiwuoc iViUITi j 0.12 

mRNA, exon 



<NONE> 



474 | U20661 



476 I U32843 



477 | U67554 



Uictyostelium | o36~ 

discoideum 

unknown internal 

repeat protein 

gene, complete 

cds, and unknown | 
Jorfl, orf2 and 
brO genes, partial , 
[cds 

Human novel 1 o.04 
homeoboxmRNA) 
for a DNA 
binding protein 

Haemophilus 
'influenzae Rd 
isection 158 of 1631 
of the complete 
[genome 



EPa^CHICK IhPHRlN TYPE-A 

RECEPTOR 5 

PRECURSOR (EC 
2.7.1.112) 

jH VPOTHETICAL 
BHLF1 PROTEIN 



<NONE> 



2.5 



*A5_HUMAN COAGULATION 
FACTOR V 
PRECURSOR 
(ACTIVATED 
PROTEIN C 
COFACTOR) 
<NONE> 



9.5 



4/8 AB004244 - 



[Methanococcus 
jannaschii section 
196 of 150 of the 
complete genome 

Narkejaponica 



0.36 



<NONE> 



<NONE> 



mRNA for Nj 
synaphin lb, 
complete cds 
Homo sapiens full 
length insert 
cDNA YQ80A08 



1.1 



MlAl^ORYSA NITRATE 

REDUCTASE 1 (EC 
1.6.6.1) (NR1) 



1.00E-12 <NONE>~ 



<NONE> 



1.00E-07 
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Table 2 



1 P VALUE ACCESSION DESCRIPTION P 



DESCRIPTION 



480 | AE000723 lAquifex aeolicus 
section 55 of 109 
of the complete 
genome 



"48T| X73902 



"4821 095094" 



H.sapiens mRNA 
for nicein B2 

[chain 

Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



483 AL01024U 



"4841 U49919 



485 AF077618 



486 AF054994 



"4871 U95102 



488 AF068627 



Plasmodium 
falciparum DNA 
*** 

SEQUENCING 
IN PROGRESS 
*** from contig 
4-64, complete 
sequence 
Arabidopsis 
thaliana lupeol 
synthase mRNA, 
[ complete cds 
Homo sapiens 
p73 gene, exon 3 

Homo sapiens 
clone 23832 
mRNA sequence 
Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
[complete cds 



YKK0_YEAST 



3.00E-10 



LMG2_HUMAN 



P53 CRIGR 



(HYPOTHETICAL 
67.5 KD PROTEIN 
IN APE1/LAP4- 
CWP1 

INTERGENIC 
REGION 
LAMININ 
GAMMA-2 CHAIN 
PRECURSOR 
CELLULAR 
TUMOR ANTIGEN 

P53 



1.2 



<NONE> 



"4891 U95102 



|Mus musculus 
|DNA cytosine-5 
methyltransferase 
3B2 (Dnmt3b) 
mRNA, 
alternatively 
spliced, complete 

]cds 

|Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
Icomplete cds 



"034 YA53_SCHPO 



Th39 MYOD_MOUSL 



0.13 



3.00E-10 



<NONE> 



HYPOTHETICAL 
24.2 KD PROTEIN 
C13A1L03IN 
CHROMOSOME I 
MYOBLAST 
DETERMINATION 
IPROTEIN 1 

<NONE> 



9.1 



3.00E-93 



5.7 



<NONE> 



5.00E-04 ACE2_YEAST 



MET ALLOTH ION E 
IN EXPRESSION 
ACTIVATOR 



3.00E-07 RIN1__P1G 



RIBONUCLEASL 
INHIBITOR 



6.00E-10 



2.1 



1.5 



0.19 
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Table 2 



. nearest Neighbor (Blas tN vs. Gii^kT 

StQ ACCESSION I DESCRIPTION 

ID 1 1 



490 I L77886 



492 U95094 



493 1 U95094 



Human protein 
tyrosine 
phosphatase 
mRNA, complete 
cds 

Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 



P VALUE 



I.00E-21 



494 I U28153 



495 | U95094 



496 U95102 



497 I U95102 



498 I U95094 



499 J U95102 



Xenopus laevis 
XL-INCENP 
(XL-INCENP) 

mDXU „ I.,. 

in, bUHI^IICLC 

cds 

Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 

cds 

Caenorhabditis 
elegans UNC-76 
(unc-76) gene, 
complete cds. 
Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
[cds 

Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 



8.00E-08 



nearest Neighbor (BlastX vs. Non- Kedundant ProteiT^ 
ACCESSION | DESCRIPTION | P VALUE" 



VS48_TBRVS 



CRP3_LIMPO 



EPA5_CHICK 



SATELLITE RNA 
48 KD PROTEIN 



C-REACTIVE 
PROTEIN 3.3 
PRECURSOR 



3.00E-09 <NONE> 



EPHRIN TYPE-A 
RECEPTOR 5 
PRECURSOR (F.C 
2.7.1.112) 



1.6 



3.5 



2.7 



0.37 



"037 I NCPR_YEAST 



0013 VMH3CAEEL 



<NONE> <NONE>' 



NADPH- 
CYTOCHROME 
P450 REDUCTASE 
(EC 1.6.2.4) (CPR) 



(Xenopus laevis 
mitotic 

jphosphoprotein 
90 mRNA, 
[complete cds 
Xenopus laevis 
XL-INCENP 
(XL-INCENP) 

ImRNA, complete 

[cds 

(Xenopus laevis 
[mitotic 

phosphoprotein 
90 mRNA, 
complete cds 



7.00E-07 I <NONE> 



PROBABLE 
INTEGRIN ALPHA 
CHAIN F54G8.3 
PRECURSOR 



3.3 



<NONE> 



<NONE> 



Z00E-07 VGLY_LYCVW 



<NONE> <NONE> 



GLYCOPROTEIN 
POLYPROTEIN 
PRECURSOR 
(CONTAINS: 

GLYCOPROTEINS 
Gl AND G2) 



3.2 
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Table 2 



I SEQ | ACCESSION | 
ID 



DESCRIPTION 



P VALUE ACCESSION 



P VALUE 



500 I U95098 



(Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 



8.00E-06 I HR78_DROMb 



NUCLEAR 

HORMONE 

RECEPTOR HR78 

(DHR78) 

(NUCLEAR 

RECEPTOR 

XR78E/F) 



2.5 



50ll U95102 



IXenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
[complete cds 



9.00E-10 MYSH_BOVIN 



MYOSIN I HEAVY 
CHAIN-LIKE 
PROTEIN (MIHC) 
(BRUSH BORDER 
MYOSIN I) (BBMI) 



"5021 U95094 



Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



2.00E-04 BAL_HUMAN 



BILE-SALT- 
ACTIVATED 
LIPASE 

PRECURSOR (EC 

1.1.3) (EC 
3.1.1.13) (BAL) 
(BILE-SALT- 
STIMULATED 
LIPASE) (BSSL) 
ESTERASE) 
(PANCREATIC 
LYSOPHOSPHOLIP 
ASE) 



503 AF080399 



Drosophila 
melanogaster 
mitotic 
checkpoint 
control protein 
kinase BUB1 
(Bubl) mRNA, 
Icomplete cds 



1.1 



NAT1 YEAST 



N-TERMINAL 
ACETYLTRANSFE 
RASE 1 (EC 
2.3.1.88) 



5041 U59706 



iGallus gallus 
alternatively 
spliced AMPA 
glutamate 
receptor, isoform 
GluR2 flop, 
(GluR2) mRNA, 
[partial cds. 



0.014 



<NONE> 



<NONE> 



<NONE> 



"5051 U95094 



Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



2.00E-05 



<NONE> 



<NONE> 



5061 U95098 



IXenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 



2.00E-04 



<NONE> 



<NONE> 
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Table 2 




Z35094 
U95I02 



515 I D38417 



516 I L10911 



Homo sapiens 1CI 3.00E-10 
YAC 9IA12, right 
end sequence 

H.sapiens CpG 
DNA, clone 90h2, 
forward read 
cpg90h2.ftla 



H.sapiens mRNA 
for SURF-2 

Aenopus iaevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 



5.00E-97 
7.00E-06 



Mouse mRNA for 
arylhydrocarbon 
receptor, 
complete cds 



e-154 



Homo sapiens 
splicing factor 
(CC 1.4) mRNA, 
complete cds. 



e-117 



NUCLEAR" 
FACTOR 1 CLONE 
PNF1/RED1 (NF-I) 
(CCAAT-BOX 
BINDING 
TRANSCRIPTION 
FACTOR) (CTF) 
(TGGCA-BINDING 

(PROTEIN) 

SUK2_HUMAN SURFEIT LOCUS" 
'PROTEIN 2 



2.4 



<NONE> 



<NONE> 



TEGU_EBV 



LARGE 

TEGUMENT 

PROTEIN 



3.4 



<NONE> 



<NONE> 



<NONE> 
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Table 2 



Nearest Neighbor (BlastN vs. Genbank)" 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



SEQ | ACCESSION 
ID 



~5VJ\ XT7093 



Human HLA-F 
gene for human 
leukocyte antigen 



DESCRIPTION I P VALUE 



ACCESSION 



0.009 



YEN1 SCHPO 



O 13695 

schizosaccharomyces 
pombe (fission 
yeast), hypothetical 
52.9 kd serine-rich 
protein cllg7. 01 in 
chromosome i. 1 1/98 



DESCRIPTION 



P VALUE 



5.4 



518 I AB017026 



Mus musculus 
mRNA for 
oxysterol-binding 
protein, complete 
cds 



OXYB HUMAN 



OXYSTEROL- 
BINDING PROTEIN 



Tl9| X55038 



Mouse mCENP-B | 
gene for 
centromere 
autoantigen B 



0.001 



YNW7 YEAST 



HYPOTHETICAL 
68.8 KD PROTEIN 
IN URE2-SSU72 
1NTERGEN1C 
REGION 



520 I ABO 18323 



Homo sapiens 
mRNA for 
KIAA0780 
protein, partial 
cds 



3.00E-41 LBR_CHICK 



LAMIN B 
RECEPTOR 



5211 U95094 



Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



1.00E-10 CA25_HUMAN 



PROCOLLAGEN 
ALPHA 2(V) 
CHAIN 
PRECURSOR 



122 1 X03558 



Human mRNA 
for elongation 
factor 1 alpha 
subunit 



EF11 HUMAN 



ELONGATION 
FACTOR 1 -ALPHA 
1 (EF-I-ALPHA-1) 



e-110 



523 1 U95102 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 



3.00E-11 YMT8_YEAST 



HYPOTHETICAL 
36.4 KD PROTEIN 
IN NUP116-FAR3 
INTERGENIC 
REGION 



524 I AB014591 



Homo sapiens 
mRNA for 
KIAA0691 
protein, complete 
cds 



NOT2 YEAST 



GENERAL 
NEGATIVE 
REGULATOR OF 
TRANSCRIPTION 
SUBUNIT 2 



525 I AB019488 



Homo sapiens 
DN A for TRKA, 
exon 17 and 
complete cds 



TRKA HUMAN 



HIGH AFFINITY 

NERVE GROWTH 

FACTOR 

RECEPTOR 

PRECURSOR 

PROTEIN) (P140- 

TRKA) 
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527 



Xenopus laevis 
mitotic 
phosphoprotein 
90 mRNA, 
complete cds 



528 I J03750 



mRNA, complete 
cds 



Mouse single 
stranded DNA 
binding protein p9 
mRNA, complete 
cds. 



529 I 095094" 



532 



Xenopus laevis 
IXL-INCENP 
|(XL-INCENP) 
mRNA, complete 
[cds 

'H.sapiens CpG 
[DNA, clone 
187a 10, reverse 
read 

Jcpgl87al0.rtla. 
jDrosophila 
melanogaster 
strawberry notch 
(sno) mRNA, 
Icomplete cds 



e-135 



P15_HUMAN 



U95094 Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



533 | U50535 



1.00E-12 I RS5_DROME 



8.00E-6I HN3BJVIOUSE 



3.00E-60 



240K PROTEIN OF 
ROD 
PHOTORECEPTOR 
CNG-CHANNEL 
CYCLIC- 
NUCLEOTIDE- 
GATED CATION 
CHANNEL 4 (CNG 
CHANNEL 4) 
MODULATORY 
SUBUNIT)) 
ZERKNUELLT 
PROTEIN 1 rZFN-n 



0.018 



ACTIVATED RNA 3.00E-21 

polymerase ii 
transcriptiona| 
l coactivator 

P15(PC4)(P14) 



40S RIBOSOMAL | 6^2~ 
PROTEIN S5 



HEPATOCYTE | 4.00E-15 
NUCLEAR 
FACTOR 3-BETA 
(HNF-3B) 

<NONE> 



4.00E-11 



<NONE> 



(Human BRCA2 
region, mRNA 
sequence CG006 



534 | X9284I 



H.sapiens MICA 
gene 



4.00E-12 ALU1_HUMAN 



I.OOE-55 LIN 1_HUMAN 



<NONE> 



<NONE> 



!!!! ALU 

SUBFAMILY J 

WARNING ENTRY 
tin 



1.1 



LINE- 1 REVERSE | 6.00E-09 
TRANSCRIPTASE 
HOMOLOG 
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Table 2 



I SEQ I ACCESSION 
ID 



DESCRIPTION 



"535*1 U60337 



(Homo sapiens 
beta-mannosidase 
mRNA, complete 
cds 



"5361 M2I731 



Human lipocortin 
V mRNA, 
complete cds. 



DESCRIPTION 



0 



NODC_BRAEL IN 



lACETYLGLUCOSA 
MINYLTRANSFER 
ASE (EC 2.4.1.-) 



e-169 



ANX5 HUMAN ANNbXlN v 

(LIPOCORTIN V) 



"5371 Y08013 



538 | <NO~~I~> " 
~539\ M98502" 



S.salar DNA 
segment 
containing GT 

|repeat 

<NONE> 
iMus musculus 
protein encoding 
twelve zinc finger 
proteins (pMLZ- 
4) mRNA, 
I complete cds. 



0.006 



<NONE> 
2.00E-17 



<NONE> 
DYNACH1CK 



(ENDONEXIN II) 
(CALPHOBINDIN 
I) (CBP-I) 
(PLACENTAL 
ANTICOAGULANT 
PROTEIN I) (PAP-I) 
(ANTICOAGULANT 

ALPHA) (VAC- 
jALPHA) 
ifANC HORIN CII) 
<NONE> | <NONE> 



<NONE> 
IdYNACTIN, 117 
KD 1SOFORM 



1.4 



1.00E-05 



"5401 U95102 



"54TT U95094 



542 AF083322 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
| co mplete cds 
Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



Homo sapiens 
centriole 
associated protein 
CEPHOmRNA, 
complete cds 
Human 
glutathione S- 
transferase 
mRNA, complete 
cds. 



6.00E-05 HXA3_HAE1N 



1.00E-13 AMOJCLEAfc, 



"TT33 CA34_HUMAN 



"iM70 GTMLHUMAN 



HEME:HEMOPEXl 
N-BINDING 
PROTEIN 
PRECURSOR 

AMINE OXIDASt 
PRECURSOR (EC 
1.4.3.6) 
(MONAMINE 
OXIDASE) 
(TYRAMINE 
[OXIDASE) 



2.6 



1.5 



PROCOLLAUfcN I ~~5~ 
ALPHA 3(IV) 
CHAIN 
PRECURSOR 

GLUTATHIONES- I 5.0UfcOy 
TRANSFERASE, 
MICROSOMAL (EC| 
2.5.1.18) 
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Table 2 



SEQ 
ID 



544 



545 



546 
547 



548 



ACCESSION 



U67522 



U95102 



<NONE> 
<NONE> 
D87001 



INSCRIPTION I P VALUE 



Methanococcus 
jannaschii section 
64 of 150 of the 
complete genome 

Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 



549 I U95094 



<NONE> 
<NONE> 
liiuman (lambda) 
1DNA for 
immunoglobulin 
[light chain 



0.37 



2.00E-07 



<NONE> 
<NONE> 
035 



Nearest Neighbor (BlastX vs. Non-Red undant Protei^ 
ACCESSION r DESCRIPTION I P VALUE 



A1AA HUMAN 



<NONE> 



<NONE> 
<NONE> 
VAL3 TYLCU 



ALPHA- 1 A 

ADRENERGIC 

RECEPTOR 

<NONE> 



<NONE> 
<NONE> 
AL3 PROTEIN (C3 



550 j D16991 



Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



551 



552 



M34025 
M98502 



553 I U95098 



Human HepG2 
partial cDNA, 
clone 

Ihmd2d01m5 
|Human fetal Ig 
heavy chain 
variable region 
Mus musculus 
protein encoding 
Itwelve zinc finger 
proteins (pMLZ- 
|4) mRNA, 
[complete cds, 



3.UOE-08 TEGU HSV11 



8.UOE-09 PTM1 YEAST 



Xenopus laevis 
'mitotic 

phosphoprotein 
44 mRNA, partial 

(cds 

H.sapiens flow- 
sorted 
chromosome 6 
Hindlll fragment, 
[SC6pA15C3 



3.2 



5.00E-14 



<NONE> 



<NONE> 



0.002 



<NONE> 



4.3 



<NONE> 



<NONE> 
<NONE> 
3.2 



LARGE 
TEGUMENT 
PROTEIN (VIRION 
PROTEIN UL36) 



0.004 



PROTEIN PTM1 
PRECURSOR 



<NONE> 



<NONE> 



0.033 



<NONE> 



<NONE> 



5 55 I U74496 iHuman 

chromosome 4q35 
subtelomeric 
[sequence 



J.OOE-20 ALU INHUMAN 



8.00E-08 



ICP4 VZVD 



<NONE> 



!!!! ALU 

SUBFAMILY J 

WARNING ENTRY 
im 



<NONE> 



5.00E-06 



trans-acting 
transcriptionaI 
l protein icp4 



0.39 
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Table 2 



^ < Ge " bank > l Nearest N ei * hbor T ^ l^i^ ^^ 



SEQ | ACCESSION 
ID 



5561 U39875 



DESCRIPTION 



Rattus norvegicus 
EF-hand Ca2+- 
binding protein 
p22 mRNA, 
complete cds. 



P VALUE ACCESSION 



2.00E-56 I YHFK_ECOLI 



HYPOTHETICAL 
79.5 KD PROTEIN 
IN CRP-ARGD 
INTERGENIC 
REGION (Q696) 



P VALUE 



9.8 



557 1 U65416 



Human MHC 
class I molecule 
(MICB) gene, 
complete cds 



0.12 



<NONE> 



558 I AG000037 



Homo sapiens 
genomic DNA, 
21 q region, clone 
9H11A22 



5.00E-25 



<NONE> 



559| U95102 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 



5.00E-05 



<NONE> 



560 I AB007918 



Homo sapiens 
mRNA for 
KIAA0449 
protein, partial 
cds 



0.015 



VGLE HSV11 



GLYCOPROTEIN E 
PRECURSOR 



5611 U58884 



Mus musculus 
SH3-containing 
protein SH3P7 
mRNA, complete 
cds. similar to 
Human Drebrin 



1.00E-73 YCV2_YEAST 



HYPOTHETICAL 
13.8 KD PROTEIN 
IN PWP2-SUP61 
INTERGENIC 
REGION 



562 I AB007878 



Homo sapiens 
KIAA0418 
mRNA, complete 
cds 



e-110 



GLU2 MAIZE 



GLUTELIN 2 
PRECURSOR 
(ZEIN-GAMMA) 
(27 KD ZEIN) 



563 I AF065482 



Homo sapiens 
sorting nexin 2 
(SNX2) mRNA, 
complete cds 



YJD6 YEAST 



5641 U27873 



Stealth virus 1 
clone 3B11T7 



0.002 



SYN1 HUMAN 



HYPOTHETICAL 
49.0 KD PROTEIN 
INNSP1-KAR2 
INTERGENIC 

REGION 

SYNAPSINS 1A 



1.6 



AND IB (BRAIN 
PROTEIN 4.1) 



565 



L38951 



Homo sapiens 
importin beta 
subunit mRNA, 
complete cds 



2.00E-68 



VP2 BRD 



STRUCTURAL 
CORE PROTEIN 
VP2 



566 I AF007155 



Homo sapiens 
clone 23763 
unknown mRNA, 
partial cds 



e-165 



YOHI AZOVI 



HYPOTHETICAL 
33.2 KD PROTEIN 
IN IBPB 5'REGION 



7.5 
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Table 2 



SEQ 
ID 



567 



Nearest Neighbor (BlastN vs. GiHbj^kT 



ACCESSION 



Z56295 



283792 



U11820 



DESCRIPTION 



H.sapiens CpG 
DNA, clone I0c2 
forward read 
cpg!0c2.ftla. 
G.gaJlus 
microsatellite 
DNA (LEI0222 




eline 

immunodeficienc 
y virus 

USIL2489J7B 
gag polyprotein 
(gag) gene, 



0.12 



Al ABCANFA 




ALPHA- IB 
ADRENERGIC 
RECEPTOR 
(FRAGMENT) 



0.85 



<NONE> 
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Table 2 



r — - ^ ' B ^;: t r, nrrr 



filO I ACCESSION | DESCR1FHON 
ID 



| P VALUE I ACCESSION 



ImRNA sequence 



576 I Y11306 iHomo sapiens I 2.00E-48 
mRNA for hTCF- 1 

4 

■577^ X65279 pWEI5 cosmid | 7.00E-69 
vector DNA 



"5781 Ml 0296 



iMouse DNA with I 0.001 
homology to EBV J 
IR3 repeat, 
segment 1, clone 

Mu2. 

Canine mRNA tor e-162 
68kDA subunit of | 
signal recognition 
particle (SRP68) 



TCF1_HUMAN 
OCLN_POriK 
LMB1.HYDA1 

SR68 CAN FA 



T-CELL-SPECIFIC 
TRANSCRIPTION 
FACTOR 1 (TCF-1) 

Q28793 potorous 
tridactylus (potoroo). 
occludin. 11/98 

LAMININ BETA-1 
CHAIN 
PRECURSOR 
(FRAGMENTS) 

SIGNAL 
RECOGNITION 
PARTICLE 68 KD 
PROTEIN (SRP68) 



2.00E-15 



0.71 



1.3 



580 | AF086438 Homo sapiens full 2.00E-04 
length insert 
IcDNA clone 
I ZD80GI1 
381 I U15140 Mycobacterium 
|bovis ribosomal 
iproteins IF-1 
complete cds, and ] 
]S4 (rpsD) gene, 

pa rtial cds 

582~1 D 13292 Human mRNA 

for ryudocan core 
jprotein 



<NONE> 



<NONE> 



e-166 



RSP4 ARATH 



"5831 S71022 



"584| L20934 



"5851 Z49269 



neoplasm-related 
CI 40 product 
[human, thyroid 
carcinoma cells, 
mRNA, 670 nt] 



Anopheles 
gambiae complete I 
mitochondrial 
Igenome 
H.sapiens gene 
for chemokine 
HCC-1. 



9.00E-30 RL6_HUMAN 



0.014 



1.1 



<NONE> 



AMY1 DICTH 



<NONE> 



<NONE> 



1.9 



5.00E-16 



<NONE> 



<NONE> 



40S RIBOSOMAL I 1.4 
PROTEIN SA (P40) 
(LAMININ 
RECEPTOR 

HOMOLOG) 

60S RIBOSOMAL I 56 
PROTEIN L6 (TAX- 
RESPONSIVE 
ENHANCER 
ELEMENT 
BINDING PROTEIN) 
10 7) (TAXREB107) 
<NONE> 



ALPHA-AMYLASE I 2X 
1 (EC 3.2.1.1) (1,4- 
ALPHA-D- 
GLUCAN 

GLUCANOHYDRO | 
LASE) 
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587 



U95098 



AF029893 




589 I U95098 



Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 

Homo sapiens i- 
beta-l,3-N- 
acetylgiucosamin 
yitransferase 
mRNA, complete 
cds 

T.thermophila 
calcium-binding 
1 25 kDa (7CBP 
25) protein gene, 
complete cds 



2.00E-04 



0.13 



<NONE> 



HEMO_PIG 



<NONE> 



HEMOPEXIN 
PRECURSOR 
(HYALURONIDAS 
E) (EC 3.2. 1.35) 



<NONE> 




590 



xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 

. [cds 

Ah060246 Mus musculus 
strain C57BL/6 
zinc finger protein 
106(Zfpl06) 
mRNA, H3a-a 
allele, complete 

____ cds 

| Y11966 B.aphidicola(host| 
T.suberi) plasmid 
IpBTsl genes 
leuA, hspA, 
repA2, repAl, 
leuB, leuC, leuD, 

[leuA 

Human SNC19 
JmRNA sequence 

Ai-043084 Lycopersicon 
esculentum 
ethylene receptor 
homolog (ETR1) 
mRNA, complete 



6.00E-04 <NONE> 



l.UOE-83 SCRB__PEDPE 



<NONE> <NONE> 



[SUCROSE^" 
PHOSPHATE 

HYDROLASE (EC 

3.2.1.26) 

(SUCRASE) 



10 



0.37 



592] U20428 



1.00E-64 



0.37 



r r -£ijA YCTU H YPOTHETICAL " 
30.8 KD PROTEIN 

|CY49.22 

KNlR_DROMb ZYGOTIC GAP 

PROTEIN KNIRPS 



0.29 



9.9 



W X65279 
W I U95098 


pwbl5cosmid i 
|vector DNA | 


5.00E-66 


<JUA1_SV40 


COAT PROTEIN 
VP1 


0.001 




Xenopus laevis 
mitotic 

phosphoprotein 
(44 mRNA, partial 1 


0.041 


UL88_HSV7J 


PROTEIN U59 


5.8 
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Table 2 



Moarpct ^.HhorJBiastN vs. Genbank) [Nearest Neighbor (B .as t* vs. .on-.ed ungant^ 



| SEQ | ACCESSION 
ID 



cds 



DESCRIPTION I P VALUE ACCESSION 



"596 1 M91452 



Sus scrofa 
ryanodine 
receptor (RYR1) 
gene, complete 
cds. 



3.2 



<NONE> 



5971 U77327 



Human Ki-1/57 
intracellular 
antigen mRNA, 
partial cds 



e-158 



GAT1 CHICK 



ERYTHROID 
TRANSCRIPTION 
FACTOR (GATA-1) 
(ERYF1) 



5981 U77327 



Human Ki-1/57 
intracellular 
antigen mRNA, 
partial cds 



RPB7 ARATH 



DNA-DIRECTED 
RNA 

POLYMERASE II 
19 KD 

POLYPEPTIDE (EC 
2.7.7.6) (RNA 
POLYMERASE II 
SUBUNIT 5) 



"599 1 Y16964 



Saccharomyces 
sp. mitochondrial 
DNA for OLI1 
gene, strain CID1 



0.37 



NMD5 YEAST 



NONSENSE- 
MEDIATED MRNA 
DECAY PROTEIN 5 



6001 095 102 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 



6.00E-06 



<NONE> 



<NONE> 



6011 U95098 



Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 



602 I AF091046 



Brugia pahangi 
nuclear hormone 
receptor (bhr-1) 
gene, partial cds 



8.00E-08 



<NONE> 



1.1 



INVO PONPY 



INVOLUCRIN 



0.23 



6031 M87339 



Human 
replication factor 
C, 37-kDasubunit| 
mRNA, complete 
cds 



AC12 HUMAN 



ACTIVATOR 1 37 
KD SUBUNIT 
(REPLICATION 
FACTOR C 37 KD 
SUBUNIT) (Al 37 
KD SUBUNIT) (RF- 
C 37 KD SUBUNIT) 
(RFC37) 



"604| D28116 



Human genes for 
collagen type IV 
alpha 5 and 6, 
exon 1 and exon 



0.39 



<NONE> 
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Table 2 



SEQ 
ID 



Nearest Neighbor (BJastN vs. Genbank) 



ACCESSION 



607 



U95102 



AE001149 



DESCRIPTION 



X14168 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 
Borrelia 
burgdorferi 
(section 35 of 70) 
of the complete 
Igenome 



P VALUE 



2.00E-06 



0.13 



Nearest Neighbor (BlastX vs. Non-Redundant Protei^T 



ACCESSION 



<NONE> 



<NONE> 



DESCRIPTION 



<NONE> 



<NONE> 



Human pLC46 
with DNA 
replication origin 



H.sapiens CpG 
DNA, clone 
187a 10, reverse 
read 

|cpgl87al0.rtla. 



Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 



U95094 



U95102 



Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 

|cds 

Homo sapiens 
mRNA, 
chromosome 1 
specific transcript 
JKIAA0487 



6.00E-16 



Z 1 36_HUMAN ZINC FINGER 
PROTEIN 136 



7.00E-90 



HN3B_RAT fHEPATOCYTE 
NUCLEAR 
FACTOR 3-BETA 
(HNF-3B) 



0.043 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
[complete cds 



7.00E-07 



e-106 



*-<JCV_MOUSb VERSICANCORE 
PROTEIN 
PRECURSOR 
(LARGE 
FIBROBLAST 
PROTEOGLYCAN) 
(CHONDROITIN 
SULFATE 
PROTEOGLYCAN 
CORE PROTEIN 2) 

|(PG-M) 

CA 1 1_CHICK PROCOLLAGEN 

ALPHA 1(1) CHAIN I 
PRECURSOR 



KKFB^CVMAi) RNA-DIRECTED" 
RNA 

POLYMERASE (EC j 
2.7.7.48) (ORF1B) 



0.005 



<NONE> 



P VALUE 



<NONE> 



<NONE> 



0.31 



1.00E-19 



3.5 



0.4 



9.7 



<NONE> 



<NONE> 
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Table 2 



SEQ 
ID 



Nearest N^.B^N V, Uen^> ^ ^ ^ ~»-~ ^ 



ACCESSION! DESCRIPTION IPVALUfc ACCESSION 



6TT1 U95094 



6U\ U95760 



I Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 

Drosophila 
melanogaster 
strawberry notch 
(sno) mRNA, 
complete cds 



6.00E-05 



UL52 EBV 



"6151 U95102 



"6161 J05499 



6171 Ml 9262 



6l8| M21191 



6191 U95094 



3.00E-71 POLGPVYHU 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
(c omplete cds 
Rattus norvegicus 
L-glutamine 
amidohydrolase 
mRNA, complete 
|cds 

Rat clathrin light 

chain (LCB3) 

mRNA, complete 
|cds 

Human aldolase 

pseudogene 

mRNA, complete 

cds 

Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



9.00E-09 VP3_ROTPC 



e-143 



0.37 



GLSL_RAT 



Y642 METJA 



HELICASE/PRIMA 
SE COMPLEX 
PROTEIN 
(PROBABLE DNA 
REPLICATION 
PROTEIN BSLF1) 
GENOME 
POLYPROTEIN 
(CONTAINS: N- 
TERMINAL 
PROTEIN; HELPER] 
COMPONENT 
PROTEINASE (EC 
3.4.22.-) (HC-PRO); 
42-50 KD 
PROTEIN; 
CYTOPLASMIC 
INCLUSION 
PROTEIN (CI); 6 
KD PROTEIN; 
NUCLEAR 
INCLUSION 
PROTEIN A (NI- A) | 
(EC 3.4.22.-) (49K 
PROTEINASE) (49 
INNER CORE 
PROTEIN VP3 



GLUTAMINASE, 
LIVER ISOFORM 
PRECURSOR (EC 
3.5.1.2) (GLS) 

HYPOTHETICAL 
PROTEIN MJ0642 



l.OOE-32 LINl_NYCCO 



1.00E-11 NUCM_BOVlN 



5.9 



4.3 



7.7 



7.00E-67 



5.8 



LINE-1 REVERSE 
TRANSCRIPTASE 
HOMOLOG 

NADH- 
UBIQUINONE 
OXIDOREDUCTAS | 
E 49 KD SUBUNIT 
(EC 1.6.5.3) (EC 
1.6.99.3) 
(COMPLEX 1- 
49KD) (CI-49KD) 



0.044 
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Nearest Neighbor (BlastN vs. Genbankf 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



SEQ 
ID 



620 



U95098 



AF041428 



DESCRIPTION 



Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 



Homo sapiens 
ribosomal protein 
s4 X isoform 
gene, complete 
cds 



P VALUE 



ACCESSION 



0.005 



0.002 



HEMZ RHOCA 



<NONE> 



DESCRIPTION 



P VALUE 



FERROCHELATAS 
E(EC 4.99.1.1) 
(PROTOHEME 
FERRO-LYASE) 



<NONE> 



4.4 



<NONE> 



623 095094" 



W4 AF 100470 



Chironomus 
thummi DNA for 
Cla repetitive 



Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
[cds 

IRattus norvegicus 
ribosome attached 
[membrane protein 
4 (RAMP4) 
mRNA, complete 
cds 



625 I U85193 



Human nuclear 
factor I-B2 
(NFIB2) mRNA, 
[complete cds 



0.13 



8.00E-04 



1.00E-53 



<NONE> 



<NONE> 



<NONE> 



2.00E-38 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



Human lamin A 
mRNA, 3'end. 



6.00E-16 



<NONE> 



<NONE> <NONE> 



Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



0.014 



ACDV RAT 



ACYL-COA 
DEHYDROGENAS 

, VERY-LONG- 
CHAIN SPECIFIC 
PRECURSOR (EC 
1.3.99.-) (VLCAD) 



4.00E-20 



629 
"630" 



<NONE> 
U95102 



Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
(cds 

<NONE> 

Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
[complete cds 



3.00E-10 



<NONE> 
2.00E-05 



<NONE> 



<NONE> 
<NONE> 



<NONE> 



<NONE> 
<NONE> 



<NONE> 



<NONE> 
<NONE> 
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Table 2 



Neither (BlastN vs. Genbank) [Nearest Neighbor (BlastX vs. N^Rc^ 



"SEQ 
ID 

631 



ACCESSION 



U95102 



DESCRIPTION 

Xenopus laevis 
mitotic 

phosphoprotein 
OmRNA, 
complete cds 



VALUE I ACCESSION 



6.00E-05 



<NONE> 



<NONE> 



<NONE> 



632 



U95094 



Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



6.00E-05 YS83_CAEEL 



HYPOTHETICAL 
86.9 KD PROTEIN 
ZK945.3 IN 
CHROMOSOME II 



0.65 



633 



U95102 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 



3.00E-09 NRP_MOUSE 



NEUROPILIN 
PRECURSOR (A5 
PROTEIN) 



2.7 



634 



U95098 



Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 



2.00E-05 



Y4JN RHISN 



HYPOTHETICAL 
16.3 KD PROTEIN 
Y4JN 



635 



U95102 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 



6.00E-05 



<NONE> 



636 



X64707 



H.sapiens BBC1 
mRNA 



^-T79 I RL1 3_HUMAN 60S RIBOSOMAL 

PROTEIN L13 
(BREAST BASIC 
CONSERVED 
PROTEIN 1) 



637 



U95I02 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 



3.00E-08 



<NONE> 



638 



X14168 



Human pLC46 
with DNA 
replication origin 



5.00E-14 SP3JWMAN 



TRANSCRIPTION 
FACTOR SP3 (SPR- 
2) (FRAGMENT) 



0.19 



639 



X90999 



H.sapiens mRNA 
for Glyoxalase II 



9.00E-20 GL02_HUMAN 



640 



AF083322 



Homo sapiens 
centriole 

associated protein 
CEPHOmRNA, 
complete cds 



HYDROXY ACYLG 
LUTATHIONE 
HYDROLASE (EC 
3.1.2.6) 



0.007 



9.00E-51 KIF4_MOUSE 



KINESIN-LIKE 
PROTEIN KIF4 



0.005 
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Table 2 



Nearest Neighbor (BlastN vs. Genbank) 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



SEQ 
ID 



ACCESSION 



DESCRIPTION 



P VALUE 



ACCESSION 



DESCRIPTION 



P VALUE 



641 



Z12002 



M.musculus Pvt-1 
mRNA. 



0.36 



CP5F CANTR 



CYTOCHROME 
P450 LIIA6 
(ALKANE- 
INDUCIBLE)(EC 
1.14.14.1) (P450- 
ALK3) 



5.6 



Ml 0206 



643 



K02668 



R.sphaeroides 
reaction center L 
subunit (complete 
cds) and M 
subunit (5* end) 
genes. 



1.1 



YGR1 YEAST 



HYPOTHETICAL 
34.8 KD PROTEIN 
IN SUT1-RCK1 
INTERGENIC 
REGION 



0.006 



E. coli ddl gene 

— 1 1 t~> 

alaninerD-alanine 
ligase and ftsQ 
and ftsA genes, 
complete cds, anc 
ftsZ gene, 5' end. 



3.3 



ANKB_HUMAN 



ANKYRIN, BRAIN 
VARIANT I 
(ANKYRIN B) 
(ANKYRIN, 
NONERYTHROID) 



7.00E-07 



<NONE> 



645 



X53616 



.domesticus 
calnexin (pp90) 
mRNA 



<NONE > 
1.1 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



<NONE> 



X57010 



Human COL2A1 
gene for collagen 
II alpha 1 chain, 
exonsE2-E15 



3.3 



PRIO PIG 



MAJOR PRION 
PROTEIN 

PRECURSOR (PRP) 



1.9 



647 



U95097 



Xenopus laevis 
mitotic 

phosphoprotein 
43 mRNA, partial 
cds 



1.1 



UL07 HSV2H 



PROTEIN UL7 



7.3 



649 



M93425 



650 



L47615 



Human CAMII- 
psi3 calmodulin 
retropseudogene 



0.37 



PRTP EBV 



Human protein 

tyrosine 

phosphatase 

(PTP-PEST) 

mRNA, complete 

cds. 



PROBABLE 
PROCESSING AND 
TRANSPORT 
PROTEIN 



7.5 



PTNC HUMAN 



PROTEIN- 
TYROSINE 
PHOSPHATASE Gl 
(EC 3.1.3.48) 
(PTPG1) 



e-107 



Mus musculus 
DNA-binding 
protein (FIi-1) 
gene, 5' end of 
cds. 



0.13 



YA53 SCHPO 



HYPOTHETICAL 
24.2 KD PROTEIN 
CI3A11.03IN 
CHROMOSOME I 



2.00E-07 



Homo sapiens 
beta-mannosidase 
mRNA, complete 



GIL1 ENTHI 



GALACTOSE- 
INHIBITABLE 
LECTIN 170 KD 



0.22 
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Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redund* 


int Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 






cds 






SUBUNIT 




652 


U08813 


Oryctolagus 

cuniculus 

Na+/glucose 

cotransporter- 

related protein 

mRNA, complete 

cds. 


1.00E-22 


NAG 1_HUMAN 


SODIUM/GLUCOS 
E 

COTRANSPORTER 
1 (NA(+)/GLUCOSE 
COTRANSPORTER 
1)(HIGH 
AFFINITY 
SODIUM- 
GLUCOSE 
COTRANSPORTER 

) 


0.1 


653 


Y00282 


Human mRNA 
for ribophorin II 


2.00E-78 


RIB2_HUMAN 


DOLICHYL- 
DIPHOSPHOOLIG 
OSACCHARIDE— 
PROTEIN 

GLYCOSYLTRANS 
FERASE 63 KD 
SUBUNIT 
PRECURSOR (EC 
2.4.1.119) 
(RIBOPHORIN II) 


5.00E-19 


654 


D10051 


Human gene for 
92-kDatype IV 
collagenase, 5- 
flanking region 


0.014 


TAGB_DICD1 


PRESTALK- 
SPECIFIC 
PROTEIN TAGB 
PRECURSOR (EC 
3.4.21.-) 


7.6 


655 


M29930 


Human insulin 
receptor (allele 2) 
gene, exons 14, 
15, 16 and 17. 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 


656 


U78310 


Homo sapiens 
pescadillo 
mRNA, complete 
cds 


0 


YG2S_YEAST 


HYPOTHETICAL 
69.9 KD PROTEIN 
IN MIC1-SRB5 
INTERGENIC 
REGION 


0.002 


657 


X68792 


S.coelicolor 
A3(2) promoter 
sequence pth270 




I DOu I LZ>r\0 1 


HYPOTHFTICAL 
27.0 KD PROTEIN 
IN VAL1-HSP26 
INTERGENIC 
REGION 


0.073 


658 


U50535 


Human BRCA2 
region, mRNA 
sequence CG006 


4.00E-12 


ALU1_HUMAN 


!!!! ALU 
SUBFAMILY J 
WARNING ENTRY 

MM 


1.2 



267 



BNSDOCID: <WO 9933982A2_L> 



WO 99/33982 




PCT/US98/27610 



Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 




HP P T PT ! fYW 


P V A T I IP 






P VALUE 


659 


U 15522 


Sus scrofa clone 
pvglalg heavy 
chain variable 
VDJ region 
mRNA, partial 
cds. 


3.2 


Z165_HUMAN 


ZINC FINGER 
PROTEIN 165 


3.2 


660 


M20918 


C.thummi piger 
haemoglobin (Hb) 
gene DNA, 
complete cds. 


0.12 


YT25_CAEEL 


HYPOTHETICAL 
59.9 KD PROTEIN 
B0304.5 IN 
CHROMOSOME II 


0.033 


661 


U60337 


Homo sapiens 
beta-mannosidase 
mRNA, complete 


0 


<NONE> 


<NONE> 


<NONE> 


662 


U95098 


Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 


0.001 


ENV_MLVFP 


ENV 

POLYPROTEIN 
PRECURSOR 
(CONTAINS: 
KNOB PROTEIN 
GP70; SPIKE 
PROTEIN PI 5E;R 
PROTEIN) 


3.3 


663 


M97287 


Human 

MAR/SAR DNA 
binding protein 
(SATBl)mRNA, 
complete cds. > :: 
gb|I58691 (158691 
Sequence 1 from 
patent US 
5652340 


0 


SAT1_HUMAN 


DNA-BINDING 
PROTEIN SATB1 
(SPECIAL AT-RICH 
SEQUENCE 
BINDING PROTEIN 
1) 


2.00E-20 


664 


L42612 


Homo sapiens 
keratin 6 isoform 
K6f (KRT6F) 
mRNA, complete 
cds 


e-168 


K2C4_BOVIN 


KERATIN, TYPE II 
CYTOSKELETAL 
59 KD, 

COMPONENT IV 


4.00E-10 


665 


U17901 


Rattus norvegicus 
phospholipase A- 
2-activating 
protein (plap) 
mRNA, complete 
cds. 


e-152 


PLAP_MOUSE 


PHOSPHOLIPASE 
A-2-ACTIVATING 
PROTEIN (PLAP) 


4.00E-13 


666 


M73047 


Homo sapiens 
tripeptidyl 
peptidase II 
mRNA, complete 
cds. 


0 


MERT_STRLI 

] 


MERCURIC 

TRANSPORT 

PROTEIN 

[MERCURY ION 

TRANSPORT 

PROTEIN) 


4.4 
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Table 2 



1 ; ■ = : 1 N earest N eig 
SEQ ACCESSION 
ID I 


hbor (BlastN vs. Genbank) [Nearest Neighbor (Bla 
DESCRIPTION | P VALUE | ACCESSION 


istX vs. Non-Keaunaai 
DESCRIPTION 


P VALUE 


667 1 U09954 t 
F 

1 c 


iumanribosomall 0 1 RL9_HUMAN C 
>rotein L9 gene, 1 
>' region and 1 
,omplete cds. | J 


>0S RIBOSOMAL 
>ROTEIN L9 


2.00E-11 


668 X98330 I 
i 

1 1 


-I.sapiens rnRNA 1-1 1 HS74_MOUbb t 
or ryanodine 1 \ 
eceptor 2 1 . i 


-IP AT QHOfK 70 

<D PROTEIN AGP- 

1 


0.034 


669 U95094 , 
< 

1 1 
1 { 


Kenopuslaevis 0.002 RPC2_DROMb J 
XL-INCENP 

;XL-INCENP) 1 

rnRNA, complete 

n ds 1 


3NA-DIRECTED 
flNA 

POLYMERASE III 
128 KD 

rOL Y rfcr 1 1UL 


1.1 


670 1 AF0692SU 
1 1 


Homo sapiens 7.00E-80 LEGB.PEA 

okadaic acid- 1 

inducible 1 

phosphoprotein 1 

(OA48-18) 

rnRNA, complete 

cds | 1 


LEGUMIN B 
(FRAGMENT) 


0.011 


671 Z71419 

672 AF044965 1 

673 X65319 


S.cerevisiae 1.1 FOCDJ5COU 
chromosome XIV 1 
reading frame I 
ORFYNL143c J 

Homo sapiens e-167 PVR_MOUSb 
polio virus related I 1 
protein 2 gene, I 
alpha isoform, 1 
exon 6 and partial 1 

cds 1 

Cloning vector 1 2.00E-80 1 S106_HUMAN 
pCAT-Enhancer 1 


OUTER 
MEMBRANE 
USHER PROTEIN 
FOCD 

PRECURSOR 

POLIOV1RUS 

RECEPTOR 

HOMOLOG 

PRECURSOR 

CALCYCL1N 
(PROLACTIN 
RECEPTOR 
ASSOCIATED 

rl\U 1 JD11N ) 

CALCIUM- 
BINDING PROTEIN 
A6) 


9.7 
1.00E-12 

"*00E-15 1 

1 

4.3 1 




674 J D29655 


Pig rnRNA for e-103 V319_ASFB7 
UMP-CMP 

kinase, complete I 
cds I 1 


J3 19 PROTEIN 




675 U95094 


Xenopus laevis 8.00E-08 VEUR_KAi 

XL-INCENP 

(XL-INCENP) 

rnRNA, complete 1 
cds 1 


" VASCULAR 
ENDOTHELIAL 
GROWTH FACTOF 
RECEPTOR 1 
PRECURSOR 
RECEPTOR FLT) 
(FLT-1) 


3.3 

1 
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Table 2 



Nearest Neighbor (tf lastN vs. Genbank) [Nearest Nei ghbor (BlastX vs. Non-Redundant Pro tein 



SEQ | ACCESSION 
ID 



676 D90217 



DESCRIPTION 



S. cerevisiae gene 
for YmL33, 
mitochondrial 
ribosoma! 
proteins of large 
subunit 



P VALUE 



ACCESSION 



2.00E-07 



MALY ECOLI 



DESCRIPTION 



MALY PROTEIN 
(EC 2.6.1.-) 



P VALUE 



5.6 



Homo sapiens 
cofactor A protein 
mRNA, complete 
cds 



e-160 



T1CA MOUSE 



TCP1- 

CHAPERONIN 
COFACTOR A 



4.00E-19 



679 1 D50418 



Gorilla gorilla 
DNA sequence 
orthologous to the 
human Xp:Yp 
telomere-junction 
region 



5.00E-14 



YHBZ ECOLI 



Mouse mRNA for 
AREC3, partial 
cds 



2.00E-79 



CYGXJRAT 



HYPOTHETICAL 
43.3 KD GTP- 

RTXirfcTXTr: nnnTrixi 

|IN DACB-RPMA 
INTERGENIC 
(REGION (F390) 



3.3 



OLFACTORY 
GUANYLYL 
CYCLASE GC-D 
PRECURSOR (EC 
[4.6.1.2) 



1.1 



Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 



8.00E-08 



P2C2_SCHPO jPROTEIN ~ | 1.00E-04 
PHOSPHATASE 2C 
HOMOLOG 2 (EC 
3.1.3.16) 



681 I ALO 10280 



Plasmodium 

falciparum DNA 
*** 

SEQUENCING 
IN PROGRESS 
*** from contig 
106, complete 
sequence 



0.12 



<NONE> 



<NONE> 



<NONE> 



Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



5.00E-04 



VSM2 TRYBB 



VARIANT 
SURFACE 
GLYCOPROTEIN 
MITAT 1.2 

PRECURSOR (VSG 
(221) 



4.3 



683] U00238 



Homo sapiens 
glutamine PRPP 
amidotransferase 
(GPAT) mRNA, 
complete cds 



<NONE> 



<NONE> 



<NONE> 



Xenopus laevis 
mitotic 

losphoprotein 
90 mRNA, 
complete cds 



0.005 



PRPR SALTY 



PROPIONATE 

CATABOLISM 

OPERON 

REGULATORY 

PROTEIN 



1.5 
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Table 2 



Nearest Neighbor (BlastN vs. Genbank) 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



| SEQ | ACCESSION 
ID 



DESCRIPTION 



"6851 U95102 



IXenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
1 complete cds 



686 



D25538 



[Human mRNA 
for KIAA0037 
gene, complete 
cds 



P VALUE 



ACCESSION 



7.00E-07 



YAND SCHPO 



HYPOTHETICAL 
30.4 KD PROTEIN 
C3H1.13IN 
CHROMOSOME I 



<NONE> 



0.38 



<NONE> 



<NONE> 



"687 1 U95102 



IXenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
[complete cds 



2.00E-07 



A1AA RAT 



ALPHA- 1 A 
ADRENERGIC 
RECEPTOR (RA42) 



688 



L26956 



Mesocncetus 
auratus stearyl- 
CoA desaturase 
sequence 
including male 
hormone 
dependent gene 
(derived from 
hamster 

Ifrankorgan 



4.00E-33 



<NONE> 



"689 ] U95102 



IXenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
[complete cds 



3.00E-10 



<NONE> 



6901 095102 



IXenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
1 complete cds 



3.00E-09 



Y093 CAEEL 



HYPOTHETICAL | 2.00E-08 
58.5 KD PROTEIN 
T20B12.3 IN 
CHROMOSOME III 



6911 U95102 



IXenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
[complete cds 



8.00E-09 



<NONE> 



692 | AB017026 



IMus musculus 
mRNA for 
oxysterol-binding 
protein, complete 
cds 



OXYB RABIT 



OXYSTEROL- I 1.00E-34 
BINDING PROTEIN 



693 1 U95098 



IXenopus laevis 
mitotic 

phosphoprotein 
1 44 mRNA, partial 
cds 



6.00E-04 



UF02 MAIZE 



FLAVONOL 3-0- 
GLUCOS YLTRANS | 
FERASE (EC 
2.4.1.91) 



3.1 
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Table 2 



Nearest Neighbor (BlastN vs. Genbank) 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



SEQ 
ID 



ACCESSION 



694 



U95102 



DESCRIPTION I P VALUE 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 



ACCESSION 



5.00E-04 



<NONE> 



DESCRIPTION 



<NONE> 



P VALUE 



<NONE> 



696 



Caenorhabditis 
elegans 
cyclophilin 
isoform 10 



5.00E-24 



CYPA CAEEL 



AB011167 



6971 U03886 



Homo sapiens 
mRNA for 
KIAA0595 
protein, partial 

cds 

Human GS2 



RFX5 HUMAN 



PEPTIDYL- 
PROLYL CIS- 
TRANS 

ISOMERASE 10 
(EC 5.2.1.8) 



2.00E-29 



mRNA, complete 
cds. 



698 | AF086275 I Homo sapiens fu 
length insert 
cDNA clone 
ZD45C02 



2.00E-28 SKD1 MOUSE 



3.00E-41 I SPT7 YEAST 



BINDING 
REGULATORY 



2.1 



SKD1 PROTEIN 



TRANSCRIPTIONA 
L ACTIVATOR 
SPT7 



4.00E-17 



0.82 



700 U95102 



701 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
(complete cds 



3.00E-10 I CA1E HUMAN 



COLLAGEN 
ALPHA 1(XV) 
CHAIN 
PRECURSOR 



1.1 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
[complete cds 



4.0ut>Il E434 ADECC 



Q65962 canine 
adenovirus type 1 
(strain ell), early e4 
31 kd protein. 11/98 



4.4 



LI 7340 IDrosophila 
melanogaster 
germline 
transcription 
factor gene, 
complete cds. 



3.3 



CISY TETTH 



CITRATE 
SYNTHASE, 
MITOCHONDRIAL 
PRECURSOR (EC 
4.1.3.7) (14 NM 
FILAMENT- 
FORMING 
PROTEIN) 



9.7 



M.musculus 
mRNA fort- 
Complex Tcp- 10a 
gene 



2.00E-45 I PME2 LYCES 



703 I Z96207 



PECTINESTERASE 
2 PRECURSOR (EC 
3.1.1.11) (PECTIN 
METHYLESTERAS 
E) (PE2) 



7.4 



|H.sapiens 
telomeric DNA 
sequence, clone 
12PTEL049,read 
12PTELOO049.se 



8.00E-08 



<NONE> 



<NONE> 



<NONE> 



272 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 



PCT/US98/27610 



Table 2 





Nearest Neighbor (BlastN vs. Genbank) 1 


Nearest Neighbor (BlastX vs. Non-Redunda 


nt Proteins) 


SEQ t 
ID 


\CCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCKJP11UN 


P VALUE 




c 


1 










704 


X58430 1 
1 


Human Hoxl.8 
gene 


e-146 


HXAA HUMAN 

] 


HOMEOBOX 
PROTEIN HOX-A10 
(HOX-lH)(HOX- 
1.8) (PL) 


4.00E-05 


705 


T Tf"\ C t~\C\ A 

U95094 


Aenopus iaevis> 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


f% OOF-06 


YN39 SYNP7 


HYPOTHETICAL 
9.2 KD PROTEIN 
IN CYST-CYSR 
INTERGENIC 
REGION (ORF 81) 


0.89 


706 


U95094 


Xenopus laevis 

(XL-INCENP) 
mRNA, complete 
cds 


1.00E-11 


MYSH_BOVIN 


MYOSIN I HEAVY 
CHAIN-LIKE 
PROTEIN (MIHC) 
(BRUSH BORDER 
MYOSIN I) (BBMI) 


0.001 


707 


M19961 


Human 
cytochrome c 
oxidase subunit 
Vb (coxVb) 
mRNA, complete 
cds. 


e-123 


OTHU5B 


<NONE> 


3.00E-30 


708 


X68380 


M.musculus gene 
for cathepsin D, 
exon 3 


^ nnF-04. 


42 MOUSE 


ERYTHROCYTE 
MEMBRANE 
PROTEIN BAND 
4.2 (P4.2) 
(PALLIDIN) 


9.9 


709 


U95102 


Xenopus iaevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


i noF-i l 


TCPA DROME 


T-COMPLEX 
PROTEIN 1 , 
ALPHA SUBUNIT 
(TCP-1-ALPHA) 


4.3 


710 


\JyD IU x 


mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3 00E-1O 


<NONE> 


<NONE> 


<NONE> 


711 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


4.00E-12 


<NONE> 


<NONE> 


<NONE> 


712 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.002 


<NONE> 


<NONE> 


<NONE> 
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Table 2 





Nearest N 


eighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEC. 
ID 


» ACCESSlOh 


4 DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


713 


ABO 18323 


Homo sapiens 
mRNA for 
KIAA0780 
protein, partial 
cds 


3.00E-41 


LBRCHICK 


LAMIN B 
RECFPTOR 


3.4 


714 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


6.00E-06 


YM8L_YEAST 


HYPOTHETICAL 
71 1 KT) PPOTFrM 

IN DSK2-CAT8 

INTERGENIC 

REGION 


3.00E-08 


715 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


4.00E-13 


PSCJDROME 


POSTERIOR SEX 


0.6 


716 


L28101 


Homo sapiens 
kallistatin (PI4) 
gene, exons 1-4, 
complete cds 


7.00E-07 


IRKXRAT 


INWARD 
Ktt 1 lrlbR 
POTASSIUM 
CHANNEL BIR9 
(KIR5.1) 


5.4 


717 


AC001038 


Homo sapiens 
(subclone 2 h2 
from PI H49) 
DNA sequence 


8.00E-09 


MGMTJYEAST 


METHYLATED- 
DNA —PROTEIN- 

METHYLTRANSFE 
RASE 


0.48 


718 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


1.00E-1 1 




19.9 KD PROTEIN 
IN SACA-UNG 
INTERGENIC 
REGION 
PRECURSOR 


1.8 


719 


U01139 


Mus musculus 
B6D2F1 clone 
2C11B mRNA. 


e-110 


GSCDROME 


HOMEOBOX 

PROTEIN 

GOOSECOID 


7.2 


720 


ABO 17430 

1 

i 

< 


Homo sapiens 
mRNA for 
Icinesin-like DNA 
binding protein, 
complete cds 


0 


YBAV_ECOLI 

] 
] 
] 


HYPOTHETICAL 
1 2 7 K n PROTFIW 

IN HUPB-COF 
INTERGENIC 
REGION 


0.17 


721 


U95094 ] 

r 
c 


Kenopus laevis 
KL-INCENP 
XL-INCENP) 
nRNA, complete 
;ds 


0.001 


CPCF_SYNP2 1 

1 
< 


J H YCOC YANOBIL 
N LYASE BETA 
3UBUNIT(EC4.-.-.- 


2.4 


722 


U95102 : 
r 

F 

S 
c 


<enopus laevis 
nitotic 

phosphoprotein 
►0 mRNA, 
omplete cds 


9.00E-10 


<NONE> 


<NONE> 


<NONE> 
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Table 2 



x,^ c t ^iohhor t BlastN v s. GenbankJ [Nearest Neighbo r (BlastX vs. Non-Kedunda nt Proteins) 
^ P VALUE ACCESSION DESCRIPTION P VALUE 



I SEQ | ACCESSION 
ID 



DESCRIPTION 



~723~I U95102 



[Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
Icomplete cds 



"7241 U95094 



I Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



0.04 



YKK7_CAEEL 



[HYPOTHETICAL 
54.9 KD PROTEIN 
C02F5.7 IN 
CHROMOSOME III 



8.00E-08 



H5 CAIMO IHISTONEH5 



0.057 



0.39 



"7251 U95094 



jXenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



3.00E-09 DED1_YEAST 



1PUTATIVEATP- 
DEPENDENTRNA 
HELICASE DED1 



7261 J04617 



iHuman elongation 
factor EF-1 -alpha 
gene, complete 
cds. > :: 
dbj|E02629|E0262 
9DNA of human 
polypeptide chain 
elongation factor- 
1 alpha 



5.00E-36 ALU7_HUMAN 



TilMALU 
SUBFAMILY SQ 
WARNING ENTRY 

Mitt 



"7271 X54859 



(Porcine TNF- 
alpha and TNF- 
beta genes for 
tumour necrosis 
factors alpha and 
beta, respectively 



-33 I Z165 HUMAN IZINC FINUER 

PROTEIN 165 



7281 D49911 



TThermus 
thermophilus 
UvrA gene, 
(complete cds 



"0014 I CC48 CAPAN CELL D1VISIUN 

CYCLE PROTEIN 
48 HOMOLOG 



0.84 



5.6 



9.9 



"729| U95098 



jXenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 



200E-06 I CA25_HUMAN PROCOLLAUtN 

ALPHA 2(V) 
CHAIN 
PRECURSOR 



730 1 D15057 



IHuman mRNA 
for DAD-1, 
jcomplete cds 



DAD 1 _HUM AN DEFENDLK 

AGAINST CELL 
[DEATH 1 (DAD-1) 



"731| U95098 



IXenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 



6 00E-06 I ANFD_RHOCA NITROCibNAbU 

IRON-IRON 
PROTEIN ALPHA 
CHAIN (EC 
1.18.6.1) 

(NITROGENASE 
COMPONENT 1) 
KDIN1TROGENASE 



9.6 
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Table 2 



SEQ 
ID 



732 



733 



Nearest Neighbor (BlastN vs. Genbank) 



ACCESSION 



734 



735 



U95098 



DESCRIPTION 



Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 



ABO 1 8335 Homo sapiens 



P VALUE 



7.00E-07 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 



ACCESSION 



DESCRIPTION 



EFTU CHLVI 



ELONGATION 
FACTOR TU (EF- 
TU) 



X98743 



U95098 



KIAA0792 
protein, complete 
cds 



H.sapiens mRNA 
for RNA helicase 
(Myc-regulated 
dead box protein) 



Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 



0.04 



2.00E-07 



TRYM_RAT (MAST CELL 

TKYPTASE 
PRECURSOR (EC 
3.4.21.59) 



<NONE> 



<NONE> 



P VALUE 



2.5 



5.6 



<NONE> I <NONE> 



<NONE> I <NONE> 



737 



738 



D 12646 



J04038 



S.cerevisiae 
chromosome X 
reading frame 
ORF YJL039c 



3.2 



<NONE> 



<NONE> 



Mouse kif4 
mRNA for 
microtubule- 
based motor 
protein KIF4, 
complete cds 



KIF4 MOUSE 



KINESIN-LIKE 
PROTEIN KIF4 



Human 

gIyceraIdehyde-3- 

phosphate 

dehydrogenase 



2.00E-47 



SDC1 HUMAN 



SYNDECAN-1 

PRECURSOR 

(SYND1)(CDI38) 



<NONE> 



2.00E-76 



3.5 



AFO 10238 



Homo sapiens 
von Hippel- 
Lindau tumor 
suppressor 



1.00E-09 



LIN1 HUMAN 



LINE-I REVERSE 
TRANSCRIPTASE 
HOMOLOG 



0.001 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 



2.00E-06 



YQJX_BACSU 



HYPOTHETICAL 
3.2 KD PROTEIN 
IN GLNQ-ANSR 
INTERGENIC 
REGION 



9.9 
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Table 2 



Nearest N eighbor (BlastN vs. Genbank) 

DESCRIPTION | P VALUE 



SEQ I ACCESSION 
ID 



[Nearest Neighbo r (BlastX vs. Non-Redundant Proteins")! 
1 — ' DESCRIPTION I P VALUE 1 



ACCESSION 



741 



L21186 



(Human lysyl 
pxidase-like 
protein mRNA, 
1 complete cds. 



-145 



OXRTL 



|<NONE> 



1.00E-34 



"7421 U95094 



743 AF0092UJ 



Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



[Homo sapiens 

YAC clone 

377A1 unknown 

mRNA, 

3 'untranslated 

1 region 



2.00E-05 CC48_SOYBN 



3.3 



<NONE> 



TCELL DIVISION 
CYCLE PROTEIN 
48 HOMOLOG 
(VALOSIN 
CONTAINING 
PROTEIN 

1ho molog)(vcp) 

<NONE> 



<NONE> 



"7441 Z74894 



TS.cerevisiae 
chromosome XV 
reading frame 
ORF YOLI52W 



-qJ2 | CD14_RAB1T IQ28680 oryctolagus 

cuniculus (rabbit). 
Imonocyte 
differentiation 
antigen cd!4 
lprecursor. 11/98 



1.9 



"7451 U95094 



Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



9.00E-10 KIN3„YEAST 



|SERINE/THREONl 
NE-PROTEIN 
KINASE KIN3 (EC 
2.7.1.-) 



2.5 



"7461 U95102 



(Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
[complete cds 



2 00E-05 | YA53 SCHPO IHYPOTHL 1 lCAL 
1 ~ 24.2 KD PROTEIN 

C13A11.03IN 
CHROMOSOME I 



7.00E-17 



"7471 S61044 



"748T U95094 



ALDH3=aldehyd 
e dehydrogenase 
isozyme 3 
[human, stomach, 
[mRNA Partial, 
,1362 nt] 
|Xenopus laevis 
iXL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



"0 I 'DHAP HUMAN lALDEHYDb I 2.00E-71 

DEHYDROGENAS 
E, DIMERIC NADP- 1 
PREFERRING (EC 
1.2.1.5) (CLASS 3) 



2.00E-08 CA1E_CH1CK 



[COLLAGEN 
ALPHA 1(XIV) 
CHAIN 
PRECURSOR 
l(UNDULIN) 



0.36 



"7491 U95102 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
[complete cds 



7.00E-06 



<NONE> 



<NONE> 



<NONE> 
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Table 2 



Nearest Neighbor (BlastN vs. Genbank) 



Nearest Neighbor (BlastX vs. Non-Re dundant Proteins! 



SEQ 
ID 



750 



L14815 



DESCRIPTION I P VALUE 



Entamoeba 
histolytica HM- 
1 :IMSS galactose- 
specific adhesin 
170kD subunit 
(hgl3) gene, 
complete cds. 



ACCESSION 



0.12 



<NONE> 



DESCRIPTION 



<NONE> 



P VALUE 



<NONE> 



T.thermophila 
gene for snRNA 
U2-2 



1.1 



<NONE> 



<NONE> 



<NONE> 



753 I AB001066 



Mytilus edulis 
mitochondrial 
NADH 

dehydrogenase 
subunit 5 (ND5) 
gene, 3' end; 
INADH 

dehydrogenase 
subunit 6 (ND6) 
Igene, complete 
cds; and 

cytochrome b (cyt 
b), 5* end. 



0.042 



DSC1 HUMAN 



Brown trout 
microsatellite 
DNA sequence 



0.38 



IMB3 HUMAN 



754 



DESMOCOLLIN 
1A/1B 

PRECURSOR 
(DESMOSOMAL 
GLYCOPROTEIN 
2/3) (DG2/DG3) 



2.6 



IlVLKUKTiN BETA-3 | 
SUBUNIT 
(KARYOPHERJN 
BETA-3 SUBUNIT) 



1.2 



AF064787 Lotus japonicus 
rac GTPase 
(activating protein 
1 mRNA, 
[complete cds 
iDictyostelium 



0.51 



<NONE> 



<NONE> 



<NONE> 



755 I U20608 



V56 M778I2 



Idiscoideum 
unknown spore 
germination- 
specific protein- 
like protein, orfl, 
orf2 and orf3 
genes, complete 

[cds 

Rabbit myosin 
heavy chain 
mRNA, complete 
cds. 



0.043 



<NONE> 



1.2 



RBL1 HUMAN 



<NONE> 



<NONE> 



Kt 11NOBLASTOM | 
A-LIKE PROTEIN 1 , 
(107 KD 
RETINOBLASTOM \ 
A-ASSOCIATED 
PROTEIN) (PRB1) 
(PI 07) 



4.9 
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Table 2 



Nearer Neighbor QM-H vs. UfW I N— (« - * » ^ HrW 

DESCRIPTIQN I P VALUE ACCESSION DfcSCKir i lON P VALUE 



| SEQ | ACCESSION 
ID 



"7571 X63789 



T.thermophila 
genes for snRN A 
U5-l,snRNA U5- 



758 D50646 



Mouse mRNA tor] 
SDF2, complete 
cds 



"759 1 L81583 



Homo sapiens 
(subclone 3_g2 
from PI Hll) 
DNA sequence 



"7601 U95102 



761 AF00037U 



"762 1 095098" 



"7631 095102 



764 I ABO 18288 



0.058 



<NONE> 



<NONE> 



<NONE> 



2 00E-27 I PMT3 YEAST DOLICHYL- 

PHOSPHATE- 
MANNOSE-- 
PROTEIN 
MANNOSYLTRAN 
SFERASE 3 (EC 
2.4.1.109) 



3 00E-19 I ALU5 HUMAN !!!! ALU 

1 " SUBFAMILY SC 

WARNING ENTRY 
I f 1 1 1 



0.86 



Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 
Homo sapiens 
polymorphic CA 
dinucleotide 
repeat flanking 
region 



2 00E-06 I SYFA YEAST PHEN YLALAN Y L- 5J 

TRNA 

synthetase 
alpha chain 
[cytoplasmic 

APP1 MOUSE AMYLOID-LIKE I 5T 
PROTEIN 1 
PRECURSOR 
(APLP) 



Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial | 

cds 

Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
c omplete cds 
Homo sapiens 
mRNA for 
K1AA0745 
protein, partial 
cds 



0.002 



<NONE> 



<NONE> 



<NONE> 



765 I AF020282 



PSF HUMAN PTB-ASSOC1A1ED 
SPLICING FACTOR) 
(PSF) 



TC2A CAEBR TRANSPUT AULfc 
ELEMENT TCB2 
TRANSPOSASE 



0.72 



1.5 



Dictyostelium 
discoideum 
DG2033 gene, 
partial cds 



0.38 



PMT2 YEAST IDOLICHYL- 



0.18 



PHOSPHATE- 
MANNOSE- 
PROTEIN 
MANNOSYLTRAN 
SFERASE 2 (EC 
2.4.1.109) 
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Table 2 



Nearest Neighbor (BlastN vs. Genbank) 



Nearest Neighbor (BlastX vs. Non-Redundant Proteins)| 



AFO 17357 



U67599 



DESCRIPTION 



Oryza sativa low 
molecular early 
light-inducible 
protein mRNA, 
complete cds 



Methanococcus 
jannaschii section 
141 of 150 of the 
complete genome 



P VALUE 



ACCESSION 



0.38 



RGS3 HUMAN 



0.13 



<NONE> 



DESCRIPTION 



REGULATOR OF 
G-PROTEIN 
SIGNALLING 3 
(RGS3) (RGP3) 



<NONE> 



P VALUE 



0.23 



<NONE> 



B.taurus 
microsatellite 
DNA INRA153 



0.13 



FAG1 SYNY3 



P73574 

synechocystis sp. 
(strain pec 6803). 3- 
iOxoacyi-[acyi-carrier 
protein] reductase 1 
(ec 1.1.1.100) (3- 
ketoacyl- acyl carrier 
[protein reductase 1). 
11/98 



5.00E-16 



Mus musculus 
synaptojanin 2 
isoform delta 
mRNA, partial 
cds 



0.043 | CA44JKUMAN (COLLAGEN 

ALPHA 4(IV) 
CHAIN 
PRECURSOR 



0.24 



Drosophila 
melanogaster 
mitochondrial 
cytochrome c 
oxidase subunits, 
ATPase6, 7 
tRNAs (Trp, Cys, 
Tyr, Leu(UUR), 
Lys, Asp, Gly) 
genes, and 
unidentified 
reading frames 
A6I, 2 and 3. 



0.021 



NU1M CITLA 



NADH- 
UBIQUINONE 
OXIDOREDUCTAS \ 
E CHAIN 1 (EC 
1.6.5.3) 



7.2 



772 ] U95094 



Human DNA 
sequence from 
clone 140L1 on 
hromosome 
22ql3.M3.31, 
complete 
sequence [Homo 
sapiens] 



3.00E-41 I ALU7_HUMAN 



II!!! ALU 
SUBFAMILY SQ 
WARNING ENTRY j 



4.00E-08 



inn 



Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 



1.00E-09 



<NONE> 



<NONE> 



<NONE> 
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Table 2 





Nearest Neighbor (BlastN vs. Genbank) 1 


Nearest Neighbor (BlastX vs. Non-Redunda 


nt Proteins) 




SEQ a 
ID 


\CCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 




773 


AF095927 

\ 
I 
i 
< 


lattus norvegicus 
jrotein 

phosphatase 2C 
nRNA, complete 
;ds 


0 


P2C PARTE ] 
] 


PROTEIN 

PHOSPHATASE 2C 
[EC 3.1.3.16) (PP2C) 


1.00E-16 




774 


X87212 1 

1 


hl.sapiens mRNA 
for cathepsin C 


0 


CATC HUMAN J 


DIPEPTIDYL- 
PEPTIDASE I 
PRECURSOR (EC 
3.4.14.1) 


2.00E-46 




775 


X05283 


Drosophila 
melanogaster 
PKCG7 gene 
exons 7-14 for 
protein kinase C 


4.5 


<NONE> 


<NONE> 


<NONE> 




776 


X03558 


Human mRNA 

for pi on cation 

factor 1 alpha 
subunit 


0 


EF1 1_HUMAN 


ELONGATION 
FACTOR 1 -ALPHA 
1 (EF-l-ALPHA-1) 


1.00E-83 


777 


X06960 


Aspergillus 
nidulans 

mitnrVinndrial 

DNA for 
cytochrome 
oxidase subunit 3, 
tRNA-Tyr 


0.23 


<NONE> 


<NONE> 


<NONE> 


/ /o 




XenoDus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


3.00E-09 


YMT8_YEAST 


HYPOTHETICAL 
36.4 KD PROTEIN 
IN NUP116-FAR3 
INTERGENIC 
REGION 


5.00E-07 


779 


U95102 


Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-07 


NAT1_YEAST 


N-TERMINAL 
ACETYLTRANSFE 
RASE 1 (EC 
2.3.1.88) 


5.00E-23 


780 


U59706 


Gallus gallus 
alternatively 
spliced AMPA 
glutamate 
receptor, isoform 
GluR2 flop, 
(GluR2) mRNA, 
partial cds. 


0.014 


PPOL_SARPE 


POLY (ADP- 
RIBOSE) 

POLYMERASE (EC 
2.4.2.30) (PARP) 


0.021 


781 


U57391 


Rattus norvegicus 
FceRI gamma- 
chain interacting 
protein SH2-B 
(SH2-B) mRNA, 
complete cds 


1.00E-84 


<NONE> 


<NONE> 


<NONE> 
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Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEC 
ID 


> ACCESSION | DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


782 


AB014591 (Homo sapiens 
mRNA for 
KIAA0691 
protein, complete 
cds 


7.00E-57 


SSGP_VOLCA 


SULFATED 
SURFACE 
GLYCOPROTEIN 
185 (SSG 185) 


5.3 


783 


AJ008065 Chrysolina bankii 
16S rRNA gene, 
mitotype B2 


0.043 


<NONE> 


<NONE> 


<NONE> 


784 


AF067212 Caenorhabditis 
elegans cosmid 
F37F2 


0.005 


MEK1 RAT 


KINASE KINASE 1 
(EC 2.7.1.-) (MEK 
KINASE 1) 




785 


U95094 Uenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
Icds 


0.042 


<NONE> 






786 


U95102 Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


9 00E-09 




< lNUiNli> 


<NONE> 


787 


Y13401 Homo sapiens 
CD3 delta gene, 
enhancer 
sequence 


8.00E-08 


<NONE> 


<NONE> 


<NONE> 


788 


AE001038 Archaeoglobus 
fulgidus section 
69 of 172 of the 
complete genome 


0.13 


<NONP> 




<NONE> 


789 


U95102 Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


2.00E-06 


<NONE> 


<NONE> 


<NONE> 


790 


AF041463 Manihot esculenta 
elongation factor 
|l -alpha 


1.4 


<NONE> 


<NONE> 


<NONE> 


791 


U95102 Xenopus laevis 
mitotic 

phosphoprotein 
90 mRNA, 
complete cds 


0.002 


HXA3_HAEIN 1 


UEME.HEMOPEXI 
^-BINDING 
*ROTEIN 
'RECURSOR 


2.7 


792 


Z12112 pWE15A cosmid 
vector DNA 


3.00E-29 


PKWA_THECU I 
J 

¥ 
( 


'UTATIVE 
5ERINE/THREONI 
4E-PROTEIN 
CINASE PKWA 
EC 2.7.1.-) 


2.00E-04 
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Table 2 



Nearest Neighb or (BlastN vs. Genbank) [Nearest ^Neighbor (B lastX vs. Non-Keaunaant P rot^eins) | 



SEQ | ACCESSION 
ID 



DESCRIPTION I P VALUE I ACCESSION 



"7931 U85193 



Human nuclear 
factor I-B2 
(NFIB2) mRNA, 
complete cds 



"7941 U89331 



Human 

pseudoautosomal 
homeodomain- 
containing protein | 
(PHOG) mRNA, 
complete cds 



795 1 AF055666 



Mus musculus 
kinesin light chain] 

(Klc2) mRNA, 
complete cds 



7961 L13321 



Homo sapiens 
iduronate-2- 
sulfatase (IDS) 
gene, exon 1, 
incomplete 5' end. | 



4.00E-44 



<NONE> 



7.00E-06 NRLHUMAN 



NEURAL RETINA- 
SPECIFIC 
LEUCINE ZIPPER 
PROTEIN (NRL) 



0.52 



PSPD BOVIN 



0.14 



YRP2 YEAST 



<NONE> 



PULMONARY 
SURFACTANT- 
ASSOCIATED 
PROTEIN D 
PRECURSOR 



HYPOTHETICAL 
84.4 KD PROTEIN 
IN RPC2/RET1 
3'REGION 



<NONE> 



6.3 



0.33 



0.27 



797 I ALO 102 /0 



Plasmodium 
falciparum DNA 

SEQUENCING 
IN PROGRESS 
*** from contig 
4-96, complete 
sequence 



0.37 



YTH3 CAEEL 



HYPOTHETICAL 
75.5 KD PROTEIN 
C14A4.3 IN 
CHROMOSOME II 



"7981 U95098 



Xenopus laevis 
mitotic 

phosphoprotein 
44 mRNA, partial 
cds 



0.015 



IMB3 HUMAN 



IMPORTIN BETA 
SUBUNIT 
(KARYOPHERIN 
BETA-3 SUBUNIT) 



0.063 



"7991 U70139 



Mus musculus 
putative CCR4 
protein mRNA, 
partial cds 



CCR4 YEAST 



GLUCOSE- 
REPRESSIBLE 
ALCOHOL 
DEHYDROGENAS 
E 

TRANSCRIPTIONA 
L EFFECTOR 
(CARBON 
CATABOLITE 
REPRESSOR 
PROTEIN 4) 



"800 | L26507 



Mouse myocyte 
nuclear factor 
(MNF) mRNA, 
complete cds. 



3.00E-41 MNF_MOUSE 



MYOCYTE 
NUCLEAR 
FACTOR (MNF) 
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Table 2 





Nearest Ne 


ighbor(BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins! 


SEC 
ID 


► ACCESSIOh 


•J DESCRIPTION 


I PVALUI 


i ACCESSION 


DESCRIPTION 


P VALUE 


80 


1 U20527 


Mus musculus 
chemokine KC 
gene, 5* region. 


0 


GRO_MOUSE 


GROWTH REGULATED 
PROTEIN PRECURSOR 
(PLATELET-DERIVED 
GROWTH FAPTOD 

INDUCIBLE PROTEIN 
KC) (SECRETORY 
PROTEIN N51) 


> 1.00E-28 


802 


. AF065482 


sorting nexin 2 
(SNX2) mRNA, 
complete cds 


0 


MYSA__DROME 


MYOSIN HEAVY 
CHAIN, MUSCLE 


0.089 


803 


U05823 


Mus musculus 
pericentrin 
mRNA, 
complete cds. 


1.00E-94 


M84DDROME 


MALE SPECIFIC 
SPERM PROTEIN 

k 4PTO A r\p» 

MST84DD 


0.099 


804 


U67468 


Methanococcus 

ifinnaQfhii 

j cu u lad^i i 1 1 

section 10 of 150 
of the complete 
genome 


0.4 


<NONE> 


<NONE> 


<NONE> 


805 


U14178 


Human type II 
IL-1 receptor 
gene, exon IB 


1.00E-19 


AMPH_HUMAN 


AMPHIPHYSIN 


2.9 


806 


L40411 


Homo sapiens 
thyroid receptor 
interactor 


0 


TRI8HUMAN 


THYROID RECEPTOR 
INTERACTING 
PROTEIN 8 (TRIP8) 


4.00E-86 


807 


D17218 

i 
1 


Human HepG2 
3* region Mbol 
cDNA, clone 
hmd3g02m3 


e-136 


C A 1 A_HUMAN 

. 


COLLAGEN ALPHA 
1(X) CHAIN 
rKtL UKb UK 


3.00E-04 


808 


Z57610 ] 
1 

l 
c 


H .sapiens CpG 
0NA, clone 
I87al0, reverse 
ead 

;pgl87al0.rtla. 


e-102 


HN3B_MOUSE ] 
] 
I 


HEPATOCYTE 
NUCLEAR FACTOR 3- 
3ETA (HNF-3B) 


1.00E-24 


809 


D 14678 I 
f 
r 

P 


iuman mRNA 
or kinesin- 
elated protein, 
martial cds 


0 


NCD_DROME ( 
S 
F 


:laret 

iEGREGATIONAL 
•ROTEIN 


1.00E-70 
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Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant 


Proteins) 


SEQ j 
ID 


\CCESSION 


DESCRIPTION J 


» VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


810 


X56317 : 
i 

( 
i 
t 


>Ciphophorus 
nacuiaiub 
Kmrk(proto- 
>ncogene) gene 
For receptor 
yrosine kinase. 


0.49 


WN1B MOUSE 1 
1 


AWT- 1 0B PROTEIN 
PRECURSOR (WNT-12) 


7.2 


811 


M36200 1 


Human 

synaptobrevin 1 
(SYBl)gene, 
sxon 5 . 


0.2 


VE2 HPV14 J 

] 


REGULATORY 
PROTEIN E2 


3.1 


812 


M18157 


Human glandular 
kallikrein gene, 

hAJIIipidC vua. 


1.5 


EKLF_MOUSE 


ERYTHROID 
KRUEPPEL-LIKE 
TRANSCRIPTION 
FACTOR (EKLF) 


1.1 


813 


D25215 


Human mRNA 
for KIAA0032 
gene, complete 
cds 


1.9 


YXIS_SACER 


HYPOTHETICAL 28.9 
KD PROTEIN IN XIS 
5'REGION(ORFl) 


1.3 


814 


M96628 


Human gene 
sequence, 5* end. 


z.UUJb-UO 




AGR1N (FRAGMENT) 


9.5 


815 


Z57610 


H. sapiens CpO 
DNA, clone 
187al0, reverse 
read 

cpgl87al0.rtla. 


» i no 
e-lU-rfi 


VTM^R MOUSE 


HEPATOCYTE 
NUCLEAR FACTOR 3- 
BETA (HNF-3B) 


1.00E-19 


816 


X14168 


Human pLC46 

with TYN A 
Willi urir\. 

replication origin 


5.00E-16 


ZN44_HUMAN 


ZINC FINGER PROTEIN 
44 (ZINC FINGER 
PROTEIN KOX7) 


1.6 


817 


Ml 9262 


Rat clathrin light 
chain (LCB3) 
mRNA, 
complete cds. 


0.28 


LMAJ3ROME 


LAMININ ALPHA 
CHAIN PRECURSOR 


4.7 


818 


AF058055 


Mus musculus 
monocarboxylate 
transporter 1 


0.2 


<NONE> 


<NONE> 


<NONE> 


815 


1 ABO 14570 


Homo sapiens 
mRNA for 
KIAA0670 
protein, partial 
cds 


0.16 


YGR1_YEAST 


HYPOTHETICAL 34.8 
KD PROTEIN IN SUT1- 
RCK1 INTERGENIC 
REGION 


4.00E-06 


82( 


) M19262 


Rat clathrin light 
chain (LCB3) 
mRNA, 
complete cds. 


0.27 


LMA_DROME 


LAMININ ALPHA 
CHAIN PRECURSOR 


4.5 
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Table 2 





Nearest Ne 


ighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins} 


SEQ 
ID 


> ACCESSION 


4 DESCRIPTIOTS 


1 PVALUi 


i ACCESSION 


DESCRIPTION 


P VALUE 


82 


1 Z54367 


for plectin 




i yjy 3JLAEEL 


HYPOTHETICAL 58.5 
KD PROTEIN T20B 12.3 
IN CHROMOSOME III 


1.00E-14 


82: 


I ABO 17026 


Mus musculus 
mRNA for 
oxysterol- 
binding protein, 
complete cds 


0 


OXYB_HUMAN 


f OXYSTEROL-BINDING 
PROTEIN 


2.00E-49 


— 823 


X58170 


M.musculus 
mRNA for t- 
Complex Tcp- 
10a gene 


1.00E-20 


UL52_HSV1 1 


DNA 

HELICASE/PRIMASE 
COMPLEX PROTEIN 
(DNA REPLICATION 
PkuiEINUL52) 


5.3 


824 


X58430 


Human Hoxl.8 
gene 


0 


HXAAJ-fUMAN 


HOMEOBOX PROTEIN 

HOX-AIO(HOX-IH) 

(HOX-1.8)(PL) 


1.00E-44 


825 


X53754 


Porcine 

sarcoplasmic/end 
oplasmic- 
reticulum Ca(2+) 
pump gene 2 3'- 
end region 


1.3 


<NONE> 


<NONE> 


<NONE> 


826 


AB005786 


Arabidopsis 
thaliana tRNA- 
Glu gene 


0.46 


<NONE> 


<NONE> 


<NONE> 


827 


AB012130 


Homo sapiens 
SBC2 mRNA for 
sodium 
bicarbonate 
cotransporter2, 
complete cds 


1.9 


<NONE> 


<NONE> 


<NONE> 


828 


AB017430 

] 

1 
( 


Homo sapiens 
mRNA for 
cinesin-like 
DNA binding 
protein, complete 
:ds 


0 


YBAV_ECOLI 

] 


HYPOTHETICAL 12.7 
KD PROTEIN IN HUPB- 

LUr JIN 1 E-KOUfNlC 

REGION 


0.063 


829 


AB007886 ] 
1 
i 

r 
\ 


Hiomo sapiens 
CIAA0426 
nRNA, 
;omplete cds 


0.042 


YDF3_SCHPO 1 
1 
I 
( 


PROBABLE 
iUKARYOTIC 
NITIATION FACTOR 
:i7C9.03 


0.52 


830 


AB018335 i 
r 
I 

F 
c 


-lomo sapiens 
nRNA for 
CIAA0792 
>rotein, complete 
ds 


e-172 


UROT_BOVIN 1 
/ 
I 
3 


["ISSUE PLASMINOGEN 
ACTIVATOR 
>RECURSOR(EC 
.4.21.68) 


0.86 
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Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


831 


D 12646 


Mouse kif4 
mRNA for 
microtubule- 
based motor 
protein KIF4, 
complete cds 


0 


KIF4_MOUSE 


KINESIN-LIKE 
PROTEIN KIF4 


9.00E-96 


832 


U38376 


Rattus 
norvegicus 
cytosolic 
phosphoiipase 
A2 mRNA, 
complete cds 


0.048 


<NONE> 


<NONE> 


<NONE> 


833 


L40411 


Homo sapiens 
thyroid receptor 
interactor 


0 


TR18_HUMAN 


THYROID RECEPTOR 
INTERACTING 

PPOTPIM R fTRIPRI 
ri\w i e»iln o ^ i t\iroj 


4.00E-86 


834 


U08110 


Mus musculus 
RNA1 homolog 
(Fugl)mRNA, 
complete cds. 


8.00E-04 


YNW7_YEAST 


HYPOTHETICAL 68.8 
KD PROTEIN IN URE2- 
SSU72 INTERGENIC 
REGION 


0.02 


835 


D50646 


Mouse mRNA 
for SDF2, 
complete cds 


1.00E-40 


YB64_YEAST 


HYPOTHETICAL 57.2 
KD PROTEIN IN MET8- 
HPC2 INTERGENIC 
REGION 


4.9 


836 


D50646 


Mouse mRNA 
for SDF2, 
complete cds 


1.00E-40 


YB64YEAST 


HYPOTHETICAL 57.2 
KD PROTEIN IN MET8- 
HPC2 INTERGENIC 
REGION 


4.9 


837 


U67459 


Methanococcus 
jannaschii 
section 1 of 150 
of the complete 
genome 


5.00E-05 


GCS1_HUMAN 


MANNOSYL- 
OLIGOSACCHARIDE 
GLUCOSIDASE(EC 
3.2.1.106) 


9.2 


838 


U18657 


Haemophilus 
influenzae LeuA 
(leuA) gene, 
partial cds, DprA 
(dprA+), orf272 
and orf!93 
genes, complete 
cds, and PfkA 
(pfkA) gene, 
partial cds. 


0.01 


STE6_YEAST 


MATING FACTOR A 
SECRETION PROTEIN 
STE6 (MULTIPLE 
DRUG RESISTANCE 
PROTEIN HOMOLOG) 
(P-GLYCOPROTEIN) 


7 
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Table 2 





Nearest Neighbor (BlastN vs. Genbank) 


Nearest Neighbor (BlastX vs. Non-Redundant Proteins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


839 


U 12523 


Rattus 
norvegicus 
ultraviolet B 
radiation- 
activated UV98 
mRNA, partial 
sequence. 


1.00E-10 


YMT8_YEAST 


HYPOTHETICAL 36.4 
KD PROTEIN IN 
NUP116-FAR3 
INTERGENIC REGION 


2.00E-06 


840 


D78255 


Mouse mRNA 
for PAP- 1, 
complete cds 


e-175 


<NONE> 


<NONE> 


<NONE> 


841 


D 17263 


Human HepG2 
3' region Mbol 

rtT^XI A /> 1 S^r\t± 
K/JWk ^ I K 7 V1VIIV 

hmd5fD7m3 


1.00E-58 


<NONE> 


<NONE> 


<NONE> 


842 


AF006751 


Homo sapiens 
ES/130 mRNA, 
complete cds 


0.061 


YRP2YEAST 


HYPOTHETICAL 84.4 
KD PROTEIN IN 
RPC2/RET1 3'REGION 


2.00E-07 


843 


U67459 


Methanococcus 
jannaschii 
section 1 of 150 
of the complete 
genome 


6.00E-05 


YC14_METJA 


HYPOTHETICAL 
PROTEIN MJ 12 14 


8.1 


844 


D88689 


Mus musculus 
mRNA for fit- 1, 
complete cds 


0.084 


ICP0_HSV2H 


TRANS-ACTING 
TRANSCRIPTIONAL 
PROTEIN ICP0 
(VMW1 18 PROTEIN) 


0.014 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



IUU V ilMlllC 


Cluster 


Clones in 


Clones in 


Clones 


ID 


Libl 


Lib2 


Lib3 


M00001340B:A06 


17062 


3 


0 


0 


M00001340D:F10 


11589 


2 


2 


1 


M00001341A:E12 


4443 


10 


6 


2 


M00001342B:E06 


39805 


2 


0 


0 


M00001343C:F10 


2790 


7 


15 


13 


M00001343D:H07 


23255 


3 


0 


1 


M00001345A:E01 


6420 


8 


0 


2 


M00001346A:F09 


5007 


4 


8 


3 


M0000 1 346D:E03 


6806 


5 


2 


1 


M0000 1 346D*G06 


5779 


5 


4 


3 


MO000 1 346DG06 


5779 


5 


4 


3 


MOO00 1 347 A'B 1 0 


13576 


5 


0 


0 


MOOOO 1 348B-B04 


16927 


4 


0 


0 


MO0O01348B'G06 

1 V1V7 W W V/ 1 J*TOJJ«VJ Vw 


16985 


4 


0 


0 


M00001 349B*B08 


3584 


5 


11 


5 


MOOOOI 350A*H01 


7187 


5 


3 


1 


MOOOOI ^SIR-AOS 


3162 


10 


14 


1 


MOOOOI 35 IB* A08 


3162 


10 


14 


1 


M00001352A*E02 


16245 


4 


0 


0 


MO000 1 353A:G 12 


8078 


4 


3 


1 


MOOOOI 353D-D10 


14929 


4 


0 


0 


MOOOO 1355B*G 10 


14391 


3 


1 


0 


M0000 1 357D-D 1 1 


4059 


8 


6 


8 


MO0O0 1 36 1 A: AOS 


4141 


5 


2 


10 


M00001361D:F08 


2379 


26 


13 


4 


MOOOOI 362B:D10 


5622 


7 


4 


2 


M00001362C:H11 


945 


9 


21 


2 


M00001365C:C10 


40132 


2 


0 


0 


MOOOOI 370A:C09 


6867 


7 


3 


0 


MOOOOI 37 1C:E09 


7172 


3 


5 


1 


MOOOOI 376B:G06 


17732 


1 


3 


5 


MOOOOI 378B:B02 


39833 


2 


0 


0 


MOOOOI 379A:A05 


1334 


27 


38 


35 


MOOOOI 380D:B09 


39886 


2 


0 


0 


MOOOOI 382C:A02 


22979 


2 


1 


0 


M00001383A:C03 


39648 


2 


0 


0 


MOOOOI 383 A:C03 


39648 


2 


0 


0 


MOOOOI 386C:B 12 


5178 


5 


5 


4 


M00001387A:C05 


2464 


5 


19 


25 


MOOOOI 387B:G03 


7587 


6 


2 


1 


M00001388D:G05 


5832 


10 


3 


0 


M00001389A:C08 


16269 


3 


0 


0 


MOOOOI 394A.F01 


6583 


2 


7 


3 


MOOOOI 395A:C03 


4016 


5 


14 


0 


M00001396A:C03 


4009 


6 


4 


13 


MOOOOI 402 A:E08 


39563 


2 


0 


0 
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Clones in Clones in Clones in 



Lib4 


Lib8 


Lib9 


0 


0 


0 


3 


3 


8 


6 


3 


11 


0 


1 


0 


14 


6 


0 


1 


0 


0 


0 


1 


0 


6 


2 


6 


2 


0 


3 


4 


0 


0 


4 


0 


0 


0 


12 


11 


2 


0 


0 


0 


0 


0 


0 


0 


2 


0 


1 


0 


6 


6 


5 


6 


6 


5 


0 


0 


0 


0 


1 


0 


1 


23 


16 


0 


0 


0 


16 


0 


1 


16 


4 


27 


2 


2 


3 


13 


1 


2 


1 


0 


0 


0 


3 


0 


0 


0 


0 


2 


0 


1 


0 


1 


4 


0 


0 


0 


28 


3 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


2 


5 


2 


16 


1 


0 


0 


0 


0 


1 


5 


0 


0 


1 


1 


2 


0 


0 


6 


0 


0 


5 


4 


10 


0 


0 


0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones 




ID 


Libl 


Lib2 


M00001407B:D11 


5556 


8 


1 


M00001409C:D12 


9577 


5 


2 


M00001410A:D07 


7005 


8 


2 


M00001412B:B10 


8551 


4 


4 


M00001415A:H06 


13538 


5 


0 


M00001416A:H01 


7674 


5 


2 


M00001416B:H11 


8847 


4 


1 


M00001417A:E02 


36393 


2 


0 


M00001418B:F03 


9952 


4 


2 


M00001418D:B06 


8526 


3 


2 


M00001421C:F01 


9577 


5 


2 


M00001423B:E07 


15066 


4 


0 


M00001424B:G09 


10470 


5 


1 


M0000 ! 4253:H0S 


22195 


3 


n 


M00001426D:C08 


4261 


4 


9 


M000O1428A:H10 


84182 


1 


0 


M00001429A:H04 


2797 


15 


11 


M00001429B:A11 


4635 


7 


9 


M00001429D:D07 


40392 


2 


0 


M00001439C:F08 


40054 


1 


0 


M00001442C:D07 


16731 


3 


1 


M00001445A:F05 


13532 


3 


2 


M00001446A:F05 


7801 


5 


2 


M00001447A:G03 


10717 


7 


2 


M00001448D:C09 


8 


1850 


2127 


M00001448D:H01 


36313 


2 


0 


M00001449A:A12 


5857 


6 


2 


M00001449A:B12 


41633 


1 


1 


M00001449A:D12 


3681 


12 


5 


M00001449A:G10 


36535 


2 


0 


M00001449C:D06 


86110 


1 


0 


MO0OO1450A:A02 


39304 


2 


0 


M00001450A:A11 


32663 


1 


1 


M00001450A:B12 


82498 


1 


0 


M00001450A:D08 


27250 


2 


0 


M00001452A.B04 


84328 


1 


0 


M00001452A:B12 


86859 


1 


0 


M00001452A:D08 


1120 


44 


41 


M00O01452A:F05 


85064 


1 


0 


M00001452C:B06 


16970 


4 


0 


M00001453A:E11 


16130 


3 


1 


M00001453C:F06 


16653 


3 


1 


M00001454A:A09 


83103 


1 


0 


M00001454B:C12 


7005 


8 


2 


M00001454D:G03 


689 


58 


95 


M00001455A:E09 


13238 


4 


1 


M00001455B:E12 


13072 


4 


1 


M00O01455D:F09 


9283 


4 


1 



Clones in Clones in 



Lib3 


Lib4 


5 


0 


0 


1 


0 


0 


0 


3 


0 


0 


o 


« 

j 


3 


0 


0 


1 


1 


1 


1 


5 


0 


1 


0 


0 


0 


2 


f\ 


r\ 


V 


\J 


7 


9 


0 


0 


18 


16 


2 


0 


1 


8 


0 


0 


0 


0 


1 


0 


4 


6 


0 


5 


1703 


3133 


0 


0 


3 


4 


0 


0 


10 


1 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


5 


11 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


17 


36 


0 


0 


0 


0 


0 


1 
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Clones in Clones in 



Lib8 


Lib9 


2 


0 


11 


12 


0 


0 


0 


0 


9 


1 


0 


0 


6 


1 


0 


0 


0 


0 


1 


0 


11 


12 


0 


0 


0 


1 


0 


0 


12 


15 


0 


0 


1 


14 


0 


0 


12 


16 


0 


0 


0 


0 


1 


2 


1 


0 


8 


0 


1355 


122 


1 


30 


0 


0 


0 


0 


2 


5 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


5 


0 


0 


0 


3 


4 


0 


1 


0 


0 


0 


0 


0 


0 


66 


95 


0 


0 


0 


0 


0 


1 
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Table 5 All Differential Data for Libs 1-4 and 8-9 
Clone Name 



Cluster Clones in 

ID Libl 

M00001455D:F09 9283 4 

M00001460A:F06 2448 23 

M00001460A:F12 39498 2 

M00001461A:D06 1531 20 

M00001463C:B11 19 1415 

M00001465A:B11 10145 2 

M00001466A:E07 4275 11 

M00001467A:B07 38759 2 

M00001467A:D04 39508 2 

M00001467A:D08 16283 3 

M00001467A:D08 16283 3 

M00001467A:E10 39442 2 

M00001468A:F05 7589 6 

M00001469A:C10 12081 4 

M00001469A:H12 19105 2 

M00001470A:B10 1037 53 

M00001470A:C04 39425 2 

M00001471A:B01 39478 2 

M00001481D:A05 7985 3 

M00001490B:C04 18699 2 

M00001494D:F06 7206 4 

M00001497A:G02 2623 12 

M00001499B:A11 10539 2 

M00001500A:C05 5336 9 

M00001500A:E11 2623 12 

M00001500C:E04 9443 4 

M00001501D:C02 9685 3 

M00001504C:A07 10185 5 

M00001504C-.H06 6974 7 

M00001504D:G06 6420 8 

M00001507A:H05 39168 2 

M00001511A:H06 39412 2 

M00001512A:A09 39186 2 

M00001512D:G09 3956 9 

M00001513A-.B06 4568 10 

M00001513C:E08 14364 1 

M00001514C:D11 40044 2 

M00001517A-.B07 4313 13 

M00001518C:B11 8952 3 

M00001528A:C04 7337 4 

M00001528A:F09 18957 3 

M00001528B:H04 8358 3 

M00001531A:D01 38085 2 

M00001532B:A06 3990 6 

M00001533A:C11 2428 14 

M00001534A:C04 16921 4 

M00001534A:D09 5097 6 

M00001534A:F09 5321 11 



Clones in 
Lib2 

1 

22 

0 

23 
1203 

0 

2 

0 

0 

0 

0 

0 

2 

0 

0 

48 

0 

0 

1 

1 

3 

4 

1 

2 

4 

2 

2 

1 

3 

0 

0 

0 

0 

9 

4 

0 

0 

6 

4 

4 

0 

3 

0 

12 

14 

0 

5 

7 



Clones in 
Lib3 

0 
2 
0 

32 
1364 
2 
5 
0 
0 
0 
0 
0 
1 
0 
2 
4 
0 
0 
4 
0 
3 
31 

1 
4 
31 

1 

0 

0 

0 

2 

0 

0 

0 

5 

0 

0 

0 

1 

0 

3 

0 

2 

0 

4 

13 

0 
1 
1 
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Clones in Clones in Clones in 



Lib4 


Lib8 


Lib9 


1 


0 


1 


3 


3 


1 


0 


0 


0 


17 


14 


14 


525 


479 


774 


o 


0 


0 


o 


4 


2 


o 


1 


1 


o 


0 


0 


n 


0 


0 


o 


o 


0 


o 

\J 


o 


0 


1 

I 




0 




o 


o 


n 


1 


0 




o 


0 


A 

U 


n 
\j 


o 


A 

u 


0 


o 


o 

vl 


1 


o 


A 
V 


o 


3 


1 
1 


2 


0 


A 
H 




1 


A 
U 




o 


O 
o 


3 


15 


A 


6 


1 


1 
1 


0 


0 


7 


2 


3 


o 


2 


4 


1 

1 


0 


0 


o 


1 


0 


o 


0 


0 


o 


0 


0 


o 


0 


0 


2 


0 


0 


9 


2 


0 


0 


0 


0 


0 


0 


0 


0 


1 


0 


4 


2 


0 


16 


12 


21 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1 


3 


1 


9 


2 


19 


1 


2 


1 


1 


3 


2 


5 


10 


26 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name Cluster Clones in 

ID Libl 

M00001534C:A01 4119 9 

M00001535A:B01 7665 3 

M00001535A:C06 20212 2 

M00001535A:F10 39423 2 

M00001536A:B07 2696 23 

M00001536A:C08 39392 2 

M00001537A:F12 39420 2 

M0000 1 537B:G07 3389 4 

M00001 54OA:D06 8286 6 

M00001541A:D02 3765 19 

M0000 1 54 1 A:F07 22085 3 

M00001541A:H03 39174 2 

M00001542A:A09 22113 3 

\A(\(\f\f\t C/IO A .T7A^* *>r\A ~ 

w a. ^-ri.n.ijVU J7tJJ ^ 

M00001544A:E03 12170 2 
M00001544A:G02 19829 2 
M00001544B:B07 6974 7 

M00001545A:C03 19255 2 

M00001545A:D08 13864 3 

M00001546A:G11 1267 43 

M00001548A:E10 5892 5 

M00001548A:H09 1058 40 

M00001549A:B02 4015 10 

M00001549A:D08 10944 3 

M00001549B:F06 4193 12 

M00001549C:E06 16347 4 

M00001550A:A03 7239 5 

M00001550A.G01 5175 8 

M00001551A:B10 6268 6 

M00001551A:F05 39180 2 

M00001551A:G06 22390 2 

M00001551C.G09 3266 12 

M00001552A.-B12 307 73 

M00001552A:D1 1 39458 2 

M00001552B:D04 5708 5 

M00001553A:H06 8298 4 

M00001553B:F12 4573 5 

M00001553D:D10 22814 3 

M00001555A.-B02 39539 2 

M00001555A:C01 39195 2 

M00001555D:G10 4561 8 

M00001556A:C09 9244 2 

M00001556A:F11 1577 12 

M00001556A:H01 15855 2 

M00001556B:C08 4386 7 

M00001556B:G02 11294 4 

M00001557A:D02 7065 5 

M00001557A:D02 7065 5 



Clones in 
Lib2 
4 
1 

0 
0 
11 
0 
0 

11 
1 

6 
0 
0 
0 
0 
1 

0 
3 
0 
0 

55 
1 

44 
5 
0 
7 
0 
2 
1 
4 
0 
1 

14 
60 
0 
4 
3 
7 
0 
0 
0 
4 
0 

40 
1 
8 
0 
3 
3 



Clones in 
Lib3 

2 
5 
1 

0 
9 
0 
0 
13 
0 
0 
0 
0 
0 
O 
2 
1 
0 
0 
2 
5 
4 
37 
8 
3 
2 
0 
1 
3 
3 
0 
0 
0 

196 
0 
4 
1 
2 
0 
0 
0 
4 
3 

25 
1 
3 
2 
2 
2 



Clones in 
Lib4 

2 
0 
1 

0 

18 

0 

0 

2 

3 

0 

0 

0 

0 

0 

0 

0 

1 

0 
1 
0 
4 

47 

15 

1 

2 

0 

0 

2 

18 

0 

0 

1 

75 

0 

3 

3 

5 

0 

0 

0 

8 

2 

3 

2 

1 

0 

1 

1 



Clones in 
Lib8 

5 

0 

0 

0 

10 

0 

0 

0 

4 

0 

0 

0 

0 

0 

0 

0 

0 

0 

2 

0 

1 

39 

2 

0 

0 

0 

2 

0 

5 

0 

0 

0 

79 

0 

1 

0 

0 

0 

1 

0 

0 
10 

4 

12 

3 

0 

0 

0 



Clones in 
Lib9 

3 
0 
0 
0 
21 
0 
0 
0 
0 
0 
1 
0 
0 
0 
0 
0 
0 
0 
4 
0 
3 

59 
0 
7 
1 
0 
0 
0 
0 
0 

1 

6 

27 
0 
4 
0 
1 
0 
0 
0 
0 
17 
0 
213 
21 
1 

0 
0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 



M00001557A:F01 
M00001557A:F03 
M00001557B:H10 
M00001557D:D09 
M00001558B:H11 
M00001560D:F10 
M00001561A:C05 
M00001563B:F06 
M00001564A:B12 
M00001571C:H06 
M00001578B:E04 
M00001579D:C03 
M00001583D:A10 
M00001586C:C05 
M00001587A:B11 
M00001594B:H04 
M00001597C:H02 
M00001597D-.C05 
M00001598A-.G03 
M00001601A:D08 
M00001604A:B10 
M00001604A:F05 
M00001607A:E11 
M00001608A:B03 
M00001608B:E03 
M00001614C:F10 
M00001617C:E02 
M00001619C:F12 
M00001621C:C08 
M00001623D:F10 
M00001624A:B06 
M00001624C:F01 
M00001630B-.H09 
M00001644C:B07 
M00001645A:C12 
M00001648C:A01 
M00001657D:C03 
M00001657D:F08 
M00001662C:A09 
M00001663A:E04 
M00001669B:F02 
M00001670C:H02 
M00001673C:H02 
M00001675A:C09 
M00001676B:F05 
M00001677C:E10 
M00001677D:A07 
M00001678D:F12 



Cluster 
ID 

9635 
39490 
5192 
8761 
7514 
6558 
39486 
102 
5053 
5749 
23001 
6539 
6293 
4623 
39380 
260 
4837 
10470 
16999 
22794 
1399 
39391 
11465 
7802 
22155 
13157 
17004 
40314 
40044 
13913 
3277 
4309 
5214 
39171 
19267 
4665 
23201 
76760 
23218 
35702 
6468 
14367 
7015 
8773 
11460 
14627 
7570 
4416 



Clones in 
Libl 

3 
2 
8 
3 
5 
4 
2 
289 
11 
4 
2 
8 
3 
3 
2 

189 
6 
5 
4 
2 
49 
2 
5 
5 
3 
4 
4 
2 
2 
2 
10 
4 
10 
2 
2 
5 
3 
1 
3 
2 
4 
3 
6 
4 
4 
1 
5 
9 



Clones in 
Lib2 

0 
0 
5 
4 
3 
3 
0 
233 
4 
1 
1 
3 
5 
4 
0 

188 
2 
1 
0 
0 

27 
0 
0 
4 
0 
1 

0 
0 
0 
1 

11 

13 

2 

0 

0 

9 

0 

0 

0 

0 

3 

0 

3 

1 

2 

2 

3 

5 



Clones in 
Lib3 

2 
0 
0 
0 
0 
4 
0 
278 
2 
9 
0 
0 
2 
12 
0 

27 

10 

0 

0 

0 

19 

0 

0 

0 

0 

0 

1 

0 

0 

2 
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3 

2 

0 

0 

0 

0 

2 

0 

0 

3 

0 

1 

4 

0 

1 

0 

2 



Clones in 
Lib4 
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0 
5 
1 
0 
0 
0 

116 
2 
0 
2 
0 
6 
2 
0 
2 
0 
2 
0 
0 

7 

0 

0 

1 

0 

3 

0 

0 

0 

0 

3 

10 

2 

0 

0 

0 

0 

2 

0 

0 

8 

0 

2 

4 

0 

0 

0 

6 



Clones in 
Lib8 

0 
1 

0 
0 
0 
0 
0 

123 

1 

0 

0 

0 

0 

1 

0 

15 

3 

0 

0 

0 

10 

0 

0 

0 

0 

1 

1 

1 

0 

0 

5 

0 

4 

0 

0 

0 

3 

0 

0 

0 

1 

0 

1 

4 

0 

1 

0 

1 



Clones in 
Lib9 

0 
0 
0 

1 

0 
5 
0 

184 
1 
0 
0 
1 

0 
1 

0 

0 

1 

1 

0 

0 
23 

0 

0 

0 

0 

0 

0 

0 

0 
1 
1 

0 
3 
0 
1 
0 
0 
5 
0 
0 
0 
0 

1 

6 
0 
0 
0 
3 
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Table 5 AJ] Differential Data for Libs 1-4 and 8-9 



Clone Name Cluster 
ID 

M00001679A:A06 6660 

M00001679A:F10 26875 

M00001679B:F01 6298 

M00001679C.F01 78091 

M00001679D:D03 10751 

M00001679D:D03 10751 

M0000 1 680D:F08 1 0539 

M00001682C:B12 17055 

M00001686A:E06 4622 

M00001688C:F09 5382 

M00001693C:G01 4393 

M0000 1 7 1 6D:H05 67252 

M00003741D:C09 40108 

— — . . , 41-T/V/ 

M00003759B:BO9 697 
M00003762C:B08 17076 
M00003763A:F06 3108 
M00003774C:A03 67907 
M00003796C:D05 5619 
M00003826B:A06 1 1350 
M00003833A:E05 21877 
M00003837D:A01 7899 
M00003839A:D08 7798 
M00003844C:B11 6539 
M00003846B:D06 6874 
M00003851B.D10 13595 
M00003853A:D04 5619 
M00003853A:F12 10515 
M00003856B:C02 4622 
M00003857A:G10 3389 
M00003857A:H03 4718 
M00003871C:E02 4573 
M00003875B:F04 12977 
M00003875B:F04 12977 
M00003875C:G07 8479 
M00003876D:E12 7798 
M00003879B:C11 5345 
M00003879B:D10 31587 
M00003879D:A02 14507 
M00003885C:A02 13576 
M00003885C:A02 13576 
M00003906C:E10 9285 
M00003907D:A09 39809 
M00003907D:H04 1 63 1 7 
M00003909D:C03 8672 
M00003912B.D01 12532 
M00003914C:F05 3900 
M00003922A:E06 23255 



Clones in 
Libl 

7 

1 

2 

1 
3 
3 
2 
4 
7 
6 
10 
1 



76 

4 

14 

1 

3 

3 

2 

5 

5 

8 

6 

4 

3 

5 

7 

4 

4 

5 

5 

5 

4 

5 

7 

1 

3 

5 

5 

4 

1 

3 

4 

4 

9 

3 



Clones in 
Lib2 

0 
0 
4 
0 
2 
2 
1 

0 
6 
2 
6 
0 
0 
0 

52 

0 

11 

0 

5 

3 

1 

4 

2 

3 

3 

0 

5 

1 

6 

11 

5 

7 

0 

0 

3 

2 

1 

1 

1 

0 
0 
3 
0 
0 
4 
1 

6 
0 



Clones in 
Lib3 
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0 

5 

0 

0 

0 

1 
0 
4 
6 
2 
0 
0 
0 

30 

0 

7 

0 

3 

0 

0 

0 

2 

0 

0 

1 

3 

0 

4 
13 

5 

2 

0 

0 

1 

2 

7 

0 

0 

0 

0 

0 

0 

0 

0 

0 

8 

1 



Clones in 
Lib4 

2 

0 

3 

0 

1 

1 

0 

0 

2 

2 

4 

1 

0 
0 
72 
0 
5 
0 
3 
0 
0 
2 
0 
0 
0 
0 
3 
1 

2 

2 

2 

5 

0 

0 

1 

0 

4 

0 

0 

0 

0 

0 

0 

0 

0 

1 
1 
1 



Clones in 
Lib8 

1 
1 
1 

0 
0 
0 

1 

0 
3 
0 
1 

0 
0 
0 
21 
0 
0 
0 
0 
1 
0 
1 
0 
0 
0 
0 
0 

1 

3 

0 

4 

0 

0 

0 

2 

0 

6 

1 

3 

12 

12 

1 

2 

0 

0 

0 

7 

0 



Clones 
Lib9 

0 
0 
0 
0 

1 

1 

0 
0 
0 
3 
1 
0 
0 
0 

30 

0 

1 

0 
4 
0 
1 
0 

1 
1 

0 

1 

4 
2 
0 
0 
6 
1 
0 
0 
4 
1 

27 
0 
1 

11 
11 
2 
1 

0 
0 
1 

13 
0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Vwiones 




ID 


LrlDl 




I ihl 






Lih9 


M00003y3oA :HUz 


1 oVD / 


J 


A 


n 


o 


o 


o 




1 07J / 


1 

J 


0 


o 

V 


o 


o 


o 


Ik Jf AAAAO ACO/™ 1 ./"! t A 

M000U39-)oL,.vj1U 


/1A/ICC 

4U4jj 


o 

z 


A 
U 




o 


o 


o 


XjfrtrtrtA'305Cr , «/^ 1 A 

MUUDOiVjoCU IU 


yjA/ICC 

4U4DD 


o 
z 


A 
U 


n 


0 


o 


o 


x >r aaaai a^cqd .rn^ 


Z4455 


o 

z 


A 

V 


i 


4 


o 


o 


X vf AAAAIQOA/*" 1 .DAQ 


4U 1ZZ 


o 

z 


A 
V 


n 


o 


o 


o 


X A AAAA*? A*7>ll"\.CAO 

M000039 /4D.h,U / 


Oil 1 A 


-5 


A 
U 


A 


o 


o 


o 


M00003974D.H0Z 


ill <fi 


1 
J 


A 
U 


A 
1/ 


o 


1 


o 


M00003975A:Ol 1 


1 o>i io 


4 


A 
U 


A 


o 


o 


o 


X A AAAAO OTOD ./"'AC 


Doy3 


/ 


A 
4 


1 

1 


-l 

J 


1 


1 


M00003981A:E10 


343U 


A 


J u 


7 

/ 




o 

\J 


o 


M00003982C:C02 


2433 


1 A 
I 0 


l J 


0 1 
Z 1 


1 o 


Q 

o 


o 


M00003983A:A05 


A 1 A C 

9105 


5 


1 


i 
1 


1 
1 


A 

u 


n 


M00004028D:A06 


6124 


4 


8 


1 


Q 


i 
i 


A 
VI 


M00004028D:C05 


A AAT1 

40073 


^% 
z 


A 
U 


i 
1 


A 
\) 


A 
V/ 


1 
1 


M00004031A:A12 


9061 


5 


Z 


A 


A 
LI 


A 
U 


V/ 


m ay a a. a a. a a A *i A a i a 

M0000403 1 A: A 1 2 


9061 


5 


z 


A 


A 
U 


A 

u 


A 
U 


M00004035C:A07 


37285 


2 


A 

0 


A 
U 


1 
1 


A 
V 


1 
1 


M00004035D:B06 


17036 


4 


0 


A 
U 


A 
U 


A 
U 


A 
VI 


M00004059A:D06 


Fin 

5417 


1 A 

10 


4 


A 
U 


Q 

y 


O 

Z 


A 
VI 


M00004068B : AO 1 


3706 


7 


1 A 

14 


4 


zz 


1 
1 


A 
VI 


X iAAAA Jl AT^Ti .1) AC 

M00004072B :B05 


17036 


4 


A 


A 

U 


A 
V 


A 

Vr 




M00004081C:D10 


15069 


3 


0 


A 

u 


1 
1 


A 
U 


A 
VI 


~m * A A A A j| A A 4 _ T^X 1 ^ 

M00004081C:D12 


1 A *5 A1 

14391 


3 


1 


A 


A 

u 


A 
U 




M00004086D:G06 


9285 


4 


3 


A 
U 


A 
VI 


1 
1 


o 


% A A A A A A ^\ f|^T\ _ A 1 

M00004087D: AO I 


Z" O O A 

6880 


2 


O 


1 
1 


1 
1 


A 
VI 


o 


M00004093D:B 1 2 


5325 


5 


c 
3 


o 
z 


A 
Vl 


O 


] 


M00004093D:B 1 2 


5325 


5 




o 

z 


A 
VI 


o 


1 


M00004 1 05C:A04 


TOO 1 

7221 


c 

D 


Z 


o 

Z 


o 

z 


A 

V 


o 


M00004 1 08 A :E06 


4937 


A 

4 


A 

y 




1 

i 






X At\f\f\f\A 111 "P\. A AO 

M000041 1 IJDrAUo 


Oo /4 


0 




A 
U 


n 

V 


o 

V 


o 


X A AAAA/1 1 1 A 1 1 

M000U41 14C:r 1 1 


1 J 1 53 


o 

Z 


J 


A 


7 


o 


1 


XyfAAAA/l 1 ">OT>-T I AO 

M00004 1 3 o JtS .HUZ 


1 JZ /Z 


o 
j 


O 

z 


A 
V 


■j 


o 


o 


\>f AAAA/t 1 yf AC*'^ 1 1 

JvlUUvlU4 14v)L*.l-, 1 1 


DZD / 


o 
Z 


0 

o 


«J 


5 


5 


25 


\>f AAAA/I 1^1 FVQ AO 

MUUUU4 131 U.tSUo 


1 AO/77 
10V / / 


4 


A 
VI 


0 


o 


o 


0 


x/TAAAA/i i zir*- a no 
MUUUU4 1 0 /CAUy 


04 J D 




1 

1 


/; 

v7 


o 


o 


0 


AAAA/t 1 /COP'P 1 O 

MUUUU4 1 oyc 1 z 


^1 1 Q 


O 


o 
z 


o 


2 


2 


3 


\>i aaaa/1 it i rvum 

JV1UUUU4 1 / 1 U.dUj 


4VUo 


o 


7 


O 


2 


2 


0 


Kvi nnnA/i i Tor^«rinR 

IV1UUUU41 /ZL.UU5 


1 I4V4 


*f 


n 


o 


o 


o 


0 


IV1UUUU4 1 5 j\s.D\) / 


1A1O0 


*l 




o 

V/ 


o 


o 


0 


X AAAAAyl 1 OCP.AAO 

M000U4 1 oDC:CU3 


1 1 A A1 
1 1443 




1 
1 


A 

u 


A 

\j 


n 

\J 


o 


M00004197D:H01 


8210 


2 


6 


0 


0 


0 


0 


M00004203B:C12 


14311 


4 


0 


0 


0 


1 


2 


M00004212B:C07 


2379 


26 


13 


4 


2 


2 


3 


M00004214C:H05 


11451 


3 


2 


1 


2 


1 


1 


M00004223A:G10 


16918 


4 


0 


0 


0 


0 


0 


M00004223B:D09 


7899 


5 


4 


0 


2 


1 


0 


M00004223D:E04 


12971 


4 


0 


0 


0 


1 


0 
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Table 5 All Differential Data for Libs 1-4 and 8-9 



Clone Name 


Cluster 


Clones in 


Clones 




ID 


Libl 


Lib2 


M00004229B:F08 


6455 


3 


1 


M0000423 OB :C07 


7212 


3 


5 


M00004269D:D06 


4905 


7 


6 


X /AAA /\ A ^% f* /<i /"i - 4 

M00G04275C:C1 1 


16914 


3 


0 


M00004283B:A04 


14286 


3 


1 


M00004285B:E08 


56020 


1 


0 


MUUUU4Z95D:r 12 


16921 


4 


0 


M00004296C.H07 


13046 


4 


1 


M00004307C.A06 


9457 


2 


0 


M00004312A:G03 


26295 


2 


0 


M00004318C.D10 


21847 


2 


1 


M00004372A:A03 


2030 


13 


10 


M00004377C:F05 


2102 


12 


20 
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Clones in 


Clones in 


Clones in 


Clones 


Lib3 


Lib4 


Lib8 


Lib9 


6 


0 


0 


0 


2 


1 


3 


0 


3 


1 


3 


1 


0 


1 


0 


0 


0 


1 


1 


1 


0 


0 


0 


0 


0 


1 


2 


1 


0 


1 


0 


0 


5 


0 


3 


0 


0 


0 


0 


0 


0 


0 


0 


0 


32 


4 


0 


0 


23 


21 


6 


5 
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Table 6 All Differential Data for Libs 15-20 



Clone Name Cluster Clones in 
ID LiblS 



MOOOO 1340B:AOo 


1 fUOZ 


n 


M00001340D:F10 


1 I joy 


n 

u 


M00001341A:E12 


AAA1 
444J 


n 

u 


M00001342B:E06 


39oU5 


U 


M00001343C:F10 




A 
U 


M00001343D:H07 


232!) 3 


A 
U 


MOOOO 1 345 A.E01 


o42U 


A 


M00001346A:F09 
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U 


M00001346D:E03 


6806 


A 

u 


M00001346D:G06 


5779 


A 
U 


M00001346D:G06 


5779 


A 
0 


M00001347A:B10 


13576 


0 


M00001348B:B04 


16927 


A 


M00001348B:G06 


16985 


A 

u 


M00001349B:B08 


3584 


A 

0 


M00001350A:H01 


7187 


A 

0 


M00001351B:A08 


3162 


0 


MO0001351B:A08 


3162 


0 


M00001352A:E02 


16245 


0 


M00001353A:G12 


8078 


0 


M00001353D:D10 


14929 


A 

0 


M00001355B:G10 


14391 


a 

0 


M00001357D:D11 


4059 


rv 

0 


M00001361A:A05 


4141 


A 

u 


M00001361D:F08 


2379 


A 


M00001362B:D10 


5622 


A 

0 


M00001362C:H11 


945 


A 

0 


M00001365C:C10 


40132 


A 


M00001370A:C09 


6867 


ft 
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7172 


ft 

u 


M00001376B:G06 
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A 

U 
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A 

U 
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u 
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U 
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0 
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0 
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0 
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1 


M00001395A:C03 
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0 


M0000l396A:C03 
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0 


MOOOO 1 402 A:E08 


39563 


0 


M0000l407B:Dll 


5556 


0 


M0000l409C:Dl2 


9577 


0 



Clones in Clones in Clones in Clones in Clones in 



LflDlOD 
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U 
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0 
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0 
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0 
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0 
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Table 6 All Differential Data for Libs 15-20 


Clone Name 


Cluster 


Clones i] 




ID 


LiblS 


MOOOO 1 4 1 OA :D07 


7005 


0 


X A AAAA l/lii i~» i /\ 

MOOOO 1 4 1 2B :B 1 0 


8551 


0 


XvfAAAAl >i i r A i 1 r\ s' 

M0000 1 4 1 5 A :H06 


13538 


0 


X/fAAAAl/IIZTA iiAt 

M00001416A:H01 


7674 


0 


X Vf AAAA 1 A 1 z"r"» Tilt 

MOOOO 1 41 6B:H1 1 


8847 


0 


X vf A AAA 1 /( 1 n a t~»/\i 

MOOOO 1 4 1 7A:E02 


36393 


0 


MOOOO 1 4 1 8B:F03 


9952 


0 


X A AAAA 1 A 1 orv T*»r\y* 

MOOOO 1 4 1 8D:B06 


8526 


0 


k jIAAAA 1 A^ 1 S*^ T^/\ -1 

MOOOO 1 42 1C:F01 


9577 


0 


MOOOO 1423B:E07 


15066 


0 


k 4AAAA 1 iA i r* An 

MO000 1 424B :G09 


10470 


0 


MOOOO 1 425 B :H08 


22195 


0 


MOOOO 1 426D:C08 


4261 


0 


MOOOO 1428A:H 10 


84182 


0 


iv/innnri i aoq a «un/i 


±* 1 S 1 


A 

w 


MOOOO 1429B:A 11 


4635 


0 


MOOOO 1429D:D07 


40392 


0 


MOOOO 1439C:F08 


40054 


0 


MOOOO 1442C:D07 


16731 


0 


MOOOO 1 445 A:F05 


13532 


0 


MOOOO 1446A:F05 


7801 


0 


MOOOO 1447 A :G03 


10717 


0 


MOOOO 1448D:C09 


8 


1 


MOOOO 1448D:H01 


36313 


0 


MOOOO 1449A:A 12 


5857 


0 


MOOOO 1449A:B 12 


41633 


0 


MOOOO 1449A:D 12 


3681 


0 


MOOOO 1449A:G 30 


36535 


0 


MOOOO 1 449C:D06 


86110 


0 


MOOOO 1 450A:A02 


39304 


0 


MOOOO 1450A:A 11 


32663 


0 


MOOOO 1450A:B 12 


82498 


0 


\ 4AAAA 1 A r f\ A /\ n 

MOOOO 1 450A:D08 


27250 


0 


MOOOO 1 452 A:B04 


84328 


0 


X A AAAA 1 A C I A !*■» -t 

MOOOO 1 452 A :B 1 2 


86859 


0 


X >TA A A A t A C*k a r\A r» 

MOOOO 1 452A:D08 


1120 


0 


XifAAAAl A r f\ a r\ r 

MOOOO 1 452 A :F05 . 


85064 


0 


X >f AAAA 1 A C\fs r%r\s~ 

MOOOO 1452C:B06 


16970 


0 


X yf A A A A 1 AO A t 

MOOOO 1453 A :E1 1 


16130 


0 


X A AAAA 1 A r~>/"\ s~ 

M00001453C:F06 


16653 


0 


X iff A AAA 1 A C A A a f\r\ 

MOOOO 1 454A:A09 


83103 
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0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 




ID 


Lib 15 


M00001461A:D06 


1531 


0 


M00001463C:B11 


19 


2 


M00001465A:B11 


10145 


0 


M00001466A:E07 


4275 


0 


M00001467A:B07 


38759 


0 


M00001467A:D04 


39508 


0 


M00001467A:D08 


16283 


0 


M00001467A:D08 


16283 


0 


M00001467A:E10 


39442 


0 


M00001468A:F05 


7589 


0 


M00001469A:C10 


12081 


0 


M00001469A:H12 


19105 


0 


M00001470A:B10 


1037 


0 


M00001470A:C04 


39425 


0 


M00001471A:B01 


39478 


0 


M00001481D:A05 


7985 


0 


M00001490B:C04 


18699 


0 


M00001494D:F06 


7206 


0 


M00001497A:G02 


2623 


0 


M00001499B:A11 


10539 


0 


M00001500A:C05 


5336 


0 


M00001500A:E11 


2623 


0 


M00001500C:E04 


9443 


0 


M00001501D:C02 


9685 


0 


M00001504C:A07 


10185 


0 


M00001504C:H06 


6974 


0 


M00001504D:G06 


6420 


0 


M00001507A:H05 


39168 


0 


M00001511A:H06 


39412 


0 


M00001512A:A09 


39186 


0 


M00001512D:G09 


3956 


0 


M00001513A:B06 


4568 


0 


M00001513C:E08 


14364 


0 


M00001514C:D11 


40044 


0 


M00001517A:B07 


4313 


0 


M00001518C:B11 


8952 


0 


M00001528A:C04 


7337 


0 


M00001528A:F09 


18957 


0 


M00001528B:H04 


8358 


0 


M00001531A:D01 


38085 


0 


M00001532B:A06 


3990 


1 


M00001533A:C11 


2428 


0 


M00001534A:C04 


16921 


0 


M00001534A:D09 


5097 


0 


M00001534A:F09 


5321 


0 


M00001534C:A01 


4119 


0 


M00001535A:B01 


7665 


0 
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0 
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39423 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 




ID 


Libl5 


M00001536A:B07 


2696 


0 


M00001536A:C08 


39392 


0 


M00001537A:F12 


39420 


0 


M00001537B:G07 


3389 


0 


M00001540A:D06 


8286 


0 


M00001541A:D02 


3765 


0 


M00001541A:F07 


22085 


0 


M00001541A:H03 


39174 


0 


M00001542A:A09 


22113 


0 


M00001542A:E06 


39453 


0 


M00001544A:E03 


12170 


0 


M00001544A:G02 


19829 


0 


M00001544B:B07 


6974 


0 


M00001545A:C03 


19255 


0 




13354 


q 


M00001546A:G11 


1267 


1 


M00001548A:E10 


5892 


0 


M00001548A:H09 


1058 


0 


M00001549A:B02 


4015 


0 


M00001549A:D08 


10944 


0 


M00001549B:F06 


4193 


0 


M00001549C:E06 


16347 


0 


M00001550A:A03 


7239 


0 


M00001550A:G01 


5175 


0 


M00001551A:B10 


6268 


0 


M00001551A:F05 


39180 


0 


M00001551A:G06 


22390 


0 


M00001551C:G09 


3266 


0 


M00001552A:B12 


307 


0 


M00001552A:D11 


39458 


0 


M00001552B:D04 


5708 


0 


M00001553A:H06 


8298 


0 


M00001553B:F12 


4573 


0 


M00001553D:D10 


22814 


0 


M00001555A.B02 


39539 


0 


M00001555A:C01 


39195 


0 


M00001555D:G10 


4561 


0 


M00001556A:C09 


9244 


0 


M00001556A.F1 1 


1577 


0 


M00001556A:H01 


15855 


3 


M00001556B:C08 


4386 


1 


M00001556B:G02 


11294 


0 


M00001557A:D02 


7065 


0 


M00001557A:D02 


7065 


0 


M00001557A:F01 


9635 


0 


M00001557A:F03 


39490 


0 


M00001557B:H10 


5192 


0 


M00001557D:D09 


8761 


0 


M00001558B:H11 


7514 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 




ID 


LiblS 


M0OO0 1 560D:F 1 0 


ODD© 


U 


M0000 1 56 1 A :C05 


Id A OA 


a 
u 


M00001 563B:r 06 


1 AO 




m 1AAAA 1 C f A A . T*^ 1 O 

M0000 1 564A :B 1 2 


50D.5 


A 

u 


M00001 57 1 C:H06 




A 

u 


M0000 1 578B:E04 


230UI 


A 

u 


M0000 1 579D:C03 


6539 


A 

u 


MOOOOl583D:AlO 


6293 


A 

u 


M0000l586C:C05 


4623 


A 

u 


M0000l587A:Bll 


"> A^ OA 

39380 


A 
0 


M0000l594B:H04 


O ZT A 

260 


A 
0 


M00001597C:H02 


4837 


A 
U 


M0000l597D:C05 


10470 


0 


M0000l598A:G03 


16999 


1 

1 


M0000l60lA:D08 


22794 


0 


M0O00l6O4A:Bl0 


1399 


0 


M00001604A:F05 


39391 


0 


M0O00l607A:Ell 


11465 


0 


M0000l608A:B03 


7802 


0 


M0000l608B:E03 


22155 


0 


M00001614C:F10 


13157 


0 


M0000l6l7C:E02 


17004 


0 


M0O00l6l9C:Fl2 


40314 


0 


M0000l62lC:C08 


40044 


0 


M0O00l623D:FlO 


13913 


0 


M0O00l624A:B06 


3277 


0 


M0OOOl624C:F0l 


4309 


A 

0 


M0000l630B:H09 


5214 


1 


M0000l644C:B07 


39171 


0 


M0000l645A:Cl2 


19267 


A. 

0 


M0000l648C:A0l 


4665 


0 


M0000l657D:C03 


2320 1 
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23218 
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u 
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6660 


0 


M0000l679A:Fl0 


26875 


0 


M0O00l679B:F0l 


6298 


0 


M0000l679C:F0l 


78091 


0 


M00001679D:D03 
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0 
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10751 


0 
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Table 6 AH Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 




ID 


LiblS 


M00001680D:F08 


10539 


0 


M00001682C:B12 


17055 


0 


M00001686A:E06 


4622 


0 


M00001688C.F09 


5382 


0 


M00001693C:G01 


4393 


0 


M00001716D:H05 


67252 


0 


M00003741D.C09 


40108 


0 


M00003747D:C05 


11476 


0 


M00003759B:B09 


697 


0 


M00003762C:B08 


17076 


0 


M00003763A:F06 


3108 


0 


M00003774C:A03 


67907 


0 


M00003796C:D05 


5619 


0 


M00003826B:A06 


11350 


0 


MQQQ03 833 A:E05 


21877 


n 


M00003837D:A01 


7899 


0 


M00003839A:D08 


7798 


0 


M00003844C:B11 


6539 


0 


M00003846B:D06 


6874 


0 


M00003851B:D10 


13595 


0 


M00003853A:D04 


5619 


0 


M00003853A:F12 


10515 


0 


MO0O03856B:CO2 


4622 


0 


M00003857A:G10 


3389 


0 


M00003857A:H03 


4718 


0 


M00003871C:E02 


4573 


0 


M00003875B:F04 


12977 


0 


M00003875B:F04 


12977 


0 


M00003875C:G07 


8479 


0 


M00003876D:E12 


7798 


0 


M00003879B:C11 


5345 


0 


M00003879B:D10 


31587 


0 


M00003879D:A02 


14507 


0 


M00003885C:A02 


13576 


0 


M00003885C.A02 


13576 


0 


M00003906C:E10 


9285 


0 


M00003907D:A09 


39809 


0 


M00003907D-.H04 


16317 


0 


M00003909D:C03 


8672 


.0 


M00003912B:D01 


12532 


0 


M00003914C:F05 


3900 


0 


M00003922A:E06 


23255 


0 


M00003958A:H02 


18957 


0 


M00003958A:H02 


18957 


0 


M00003958C:G10 


40455 


0 


M00003958C:G10 


40455 


0 


M00003968B:F06 


24488 


0 


M00003970C:B09 


40122 


0 


M00003974D:E07 


23210 


0 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 




ID 


LiblS 


M00003974D:H02 


23358 


0 


M00003975A:G11 


12439 


0 


M00003978B:G05 


5693 


0 


M00003981A:E10 


3430 


0 


M00003982C:C02 


2433 


0 


M00003983A:A05 


9105 


0 


M00004028D:A06 


6124 


0 


M00004028D:C05 


40073 


0 


M00004031A:A12 


9061 


0 


M00004031A:A12 


9061 


0 


M00004035C:A07 


37285 


0 


M00004035D:B06 


17036 


0 


M00004059A:D06 


5417 


0 


M00004068B:A01 


3706 


0 


M00004072B:B05 


17036 


0 


M00004081C:D10 


15069 


0 


M00004081C:D12 


14391 


0 


M00004086D:G06 


9285 


0 


M00004087D:A01 


6880 


0 


M00004093D:B12 


5325 


1 


M00004093D:B12 


5325 


1 


M00004105C:A04 


7221 


0 


M00004108A:E06 


4937 


0 


M00004111D:A08 


6874 


0 


M00004114C:F11 


13183 


0 


M00004138B:H02 


13272 


0 


M00004146C:C11 


5257 


0 


M00004151D:B08 


16977 


0 


M00004157C:A09 


6455 


0 


M00004169C:C12 


5319 


0 


M00004171D:B03 


4908 


0 


M00004172C:D08 


11494 


0 


M00004183C:D07 


16392 


0 


M00004185C:C03 


11443 


0 


M00004197D:H01 


8210 


0 


M00004203B:C12 


14311 


0 


M00004212B:C07 


2379 


0 


M00004214C:H05 


11451 


0 


M00004223A:G10 


16918 


0 


M00004223B:D09 


7899 


0 


M00004223D:E04 


12971 


0 


M00004229B:F08 


6455 


0 


M00004230B:C07 


7212 


0 


M00004269D:D06 


4905 


0 


M00004275C:C11 


16914 


0 


M00004283B:A04 


14286 


0 


M00004285B:E08 


56020 


0 


M00004295D:F12 


16921 


0 


M00004296C:H07 


13046 
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Table 6 All Differential Data for Libs 15-20 



Clone Name 


Cluster 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 


Clones in 




ID 


LiiblS 


Lib lob 


Libl7 


Libl8 


Libl9 


Lib20 


M00004307C:A06 


9457 


0 


0 


0 


0 


0 


0 


M00004312A:G03 


26295 


0 


0 


0 


0 


0 


0 


M00004318C:D10 


21847 


0 


0 


0 


0 


0 


0 


M00004372A:A03 


2030 


0 


0 


0 


0 


0 


0 


M00004377C:F05 


2102 


0 


0 


0 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



Clone Name 


Cluster jlu 


M0000 1 340B : A06 


1 /uoz 
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M00001341A:L12 
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J70UJ 
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Table 7 All Differential Data for Libs 12-14 



Clone Name 



Cluster ID 



Clones in 
Libl2 



M00001409C:D12 


9577 


0 


M00001410A.-D07 


7005 


0 


M00001412B:B10 


8551 


0 


M00001415A.H06 


13538 


0 


M00001416A:H01 


7674 


0 


M00001416B:H11 


8847 


1 


M00001417A:E02 


36393 


0 


M00001418B:F03 


9952 


0 


M00001418D.-B06 


8526 


0 


M00001421C:F01 


9577 


0 


M00001423B:E07 


15066 


0 


M00001424B:G09 


10470 


0 


M00001425B:H08 


22195 


0 
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4261 
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0 
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4635 


0 


M00001429D:D07 
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0 
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M00001449A:G10 


36535 


0 


M00001449C:D06 
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82498 


0 


M00001450A:D08 


27250 


0 


M00001452A:B04 


84328 


0 


M00001452A:B12 


86859 


0 


M00001452A:D08 


1120 


0 


M00001452A:F05 


85064 


0 


M00001452C.B06 


16970 


1 


M00001453A:E1 1 


16130 


0 


M00001453C:F06 


16653 


0 


M00001454A:A09 


83103 


0 


M00001454B:CI2 


7005 


0 


M00001454D:G03 


689 


0 


M00001455A:E09 


13238 


0 


M00001455B:E12 


13072 


0 


M00001455D:F09 


9283 


0 


M00001455D.F09 


9283 


0 



Clones in 
Libl3 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 
6 
0 
0 
0 
0 
0 
0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



PCT/US98/27610 



Clones in 
Libl4 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

9 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 
0 
0 
0 



306 



BNSOOCID: <WO 9933982A2_I_> 



WO 99/33982 

Table 



7 All Differential Data for Libs 



12-14 



PCT/US98/27610 



Clone Name 


Cluster ID 


M00001460A:F06 


2448 


M00001460A:F12 


39498 


M00001461A:D06 


1531 


M00001463C:B11 


19 


M00001465A:B11 


10145 


M00001466A:E07 


4275 


M00001467A:B07 


38759 


M00001467A:D04 


39508 


M00001467A:D08 


16283 


M00001467A:D08 


16283 


M00001467A:E10 


39442 


M00001468A:F05 


7589 


M00001469A:C10 


12081 


M00001469A:H12 


19105 


M00001470A:B10 


1037 


M00001470A:C04 


39425 


M00001471A:B01 


39478 


M00001481D;A05 


7985 


M00001490B:C04 


18699 


M00001494D:F06 


7206 


M00001497A:G02 


2623 


M00001499B:A11 


10539 


M00001500A.C05 


5336 


M00001500A:E11 


2623 


M00001500C:E04 


9443 


M00001501D:C02 


9685 


M00001504C:A07 


10185 


M00001504C:H06 


6974 


M00001504D:G06 


6420 


M00001507A:H05 


39168 


M00001511A:H06 


39412 


M00001512A:A09 


39186 


M00001512D:G09 


3956 


M00001513A:B06 


4568 


M00001513C:E08 


14364 


M00001514C:D11 


40044 


M00001517A:B07 


4313 


M00001518C:B11 


8952 


M00001528A:C04 


7337 
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M00001534C:A01 
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Table 7 All Differential Data for Libs 12-14 



Clone Name 


Cluster ID 


Clones in 


Clones in 


Clones in 






Libl2 


Libl3 


Libl4 


M00001535A:B01 


7665 


0 


2 


4 


M00001535A:C06 


20212 


0 


0 


0 


M00001535A:F10 


39423 


0 


0 


0 


M00001536A:B07 


2696 


0 


0 


0 


M00001536A:C08 


39392 


0 


0 


0 


M00001537A:F12 


39420 


0 


0 


0 


M00001537B:G07 


3389 


0 


0 


0 


M00001540A:D06 


8286 


0 


0 


0 


MUOUO 1 54 1 A:D02 


3765 


0 


0 


0 


M00001541A:F07 


22085 


0 


0 


0 


M00001541A:H03 


39174 


0 


0 


0 


M00001542A:A09 


22113 
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0 


M00001542A:E06 


39453 
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0 


M00001544A:E03 
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0 
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M00001545A.C03 


19255 
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0 


M00001545A:D08 


13864 


0 


0 


0 


M00001546A:G11 


1267 


0 


0 


0 


M00001548A:E10 


5892 


0 


1 


0 


M00001548A:H09 


1058 


1 


3 


0 


M00001549A:B02 


4015 


0 


1 


0 


M00001549A:D08 


10944 


1 


0 


0 


M00001549B:F06 


4193 


0 


0 


0 


M00001549C:E06 


16347 


0 


0 


0 


M00001550A:A03 


7239 


0 


1 


0 


M00001550A:G01 


5175 


1 


0 


0 


M00001551A.-B10 


6268 


0 


0 


1 


M00001551A:F05 


39180 


0 


0 


0 


M00001551A:G06 


22390 


0 


0 


1 


M00001551C:G09 


3266 


0 


0 


0 


M00001552A:B12 


307 


6 


11 


4 


M00001552A:D1 1 


39458 


0 


0 


0 


M00001552B:D04 


5708 


0 


0 


0 


M00001553A:H06 


8298 


0 


0 


0 


M00001553B:F12 


4573 


0 


0 


0 


M00001553D:D10 


22814 


0 


0 


0 


M00001555A:B02 


39539 


0 


0 


0 


M00001555A.C01 


39195 


0 


0 


0 


M00001555D:G10 


4561 


0 


0 


0 


M00001556A:C09 


9244 


0 


1 


0 


M00001556A:F11 


1577 


0 
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15855 
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0 
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M00001556B:G02 


11294 
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0 


M00001557A:D02 


7065 
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0 


M00001557A:D02 


7065 
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0 


M00001557A:F01 
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0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



Clone Name 

M00001557A:F03 

M00001557B:H10 

M00001557D:D09 

M00001558B:H11 

M00001560D:F10 

M00001561A:C05 

M00001563B:F06 

M00001564A:B12 

M00001571C:H06 

M00001578B:E04 

MO00O1579D:C03 

M00001583D:A10 

M00001586C:C05 

M00001587A:B11 

M00001594B:H04 

M00001597C:H02 

M00001597D:C05 

M00001598A:G03 

M00001601A:D08 

M00001604A:B10 

M00001604A:F05 

M00001607A-.E11 

M00001608A:B03 

M00001608B:E03 

M00001614C:F10 

M00001617C:E02 

M00001619C:F12 

M00001621C:C08 

M00001623D:F10 

M00001624A:B06 

M00001624C:F01 

M00001630B:H09 

M00001644C:B07 

M00001645A:C12 

M00001648C-.A01 

M00001657D:C03 

M00001657D:F08 

M00001662C:A09 

M00001663A:E04 

M00001669B:F02 

M00001670C:H02 

M00001673C-.H02 

M00001675A:C09 

M00001676B:F05 

M00001677C:E10 

M00001677D:A07 

M00001678D:F12 

M00001679A:A06 



Cluster ID 

39490 
5192 
8761 
7514 
6558 
39486 
102 
5053 
5749 
23001 
6539 
6293 
4623 
39380 
260 
4837 
10470 
16999 
22794 
1399 
39391 
11465 
7802 
22155 
13157 
17004 
40314 
40044 
13913 
3277 
4309 
5214 
39171 
19267 
4665 
23201 
76760 
23218 
35702 
6468 
14367 
7015 
8773 
11460 
14627 
7570 
4416 
6660 



Clones in 
Libl2 

0 
0 
0 
0 
0 
0 
2 
0 
0 
0 
0 
0 
0 
0 

1 
1 

0 

4 

0 

6 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

2 

0 

0 

1 

0 



Clones in 
Libl3 

0 
0 
0 
0 
0 
0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

2 

0 

3 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

2 

0 



PCT/US98/27610 



Clones in 
Libl4 

0 

0 

0 

0 

0 

0 

2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

6 

0 

3 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 
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Table 7 All Differential Data for Libs 12-14 



Clone Name 

MOO00I679A:F10 

M00001679B:F01 

M00001679C:F01 

M00001679D:D03 

M00001679D.-D03 

M00001680D:F08 

M00001682C.B12 

M00001686A:E06 

M00001688C:F09 

M00001693C:G01 

M00001716D.H05 

M00003741D:C09 

M00003747D:C05 

M00003759B:B09 

ivivui/uj /U6\^:dV5 

M00003763A:F06 

M00003774C:A03 

M00003796C:D05 

M00003826B:A06 

M00003833A:E05 

M00003837D:A01 

M00003839A:D08 

M00003844C:B11 

M00003846B.D06 

M00003851B:D10 

M00003853A:D04 

M00003853A:F12 

M00003856B.C02 

M00003857A:G10 

M00003857A:H03 

M00003871C:E02 

M00003875B:F04 

M00003875B.F04 

M00003875C:G07 

M00003876D:E12 

M00003879B:C11 

M00003879B:D10 

M00003879D:A02 

M00003885C:A02 

M00003885C:A02 

M00003906C:E10 

M00003907D:A09 

M00003907D:H04 

M00003909D:C03 

M00003912B.-D01 

M00003914C:F05 

M00003922A:E06 

M000O3958A:HO2 



Cluster ID 

26875 
6298 
78091 
10751 
10751 
10539 
17055 
4622 
5382 
4393 
67252 
40108 
11476 
697 
17076 
3108 
67907 
5619 
11350 
21877 
7899 
7798 
6539 
6874 
13595 
5619 
10515 
4622 
3389 
4718 
4573 
12977 
12977 
8479 
7798 
5345 
31587 
14507 
13576 
13576 
9285 
39809 
16317 
8672 
12532 
3900 
23255 
18957 



Clones in 
Libl2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

4 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 



Clones in 
Libl3 

0 
0 
0 
0 
0 

1 

0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 

1 

0 
0 
0 
0 
0 
0 
0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

8 
0 
0 
0 
0 
0 
0 
0 
0 
0 

1 

0 
0 



PCT7US98/27610 



Clones in 
Libl4 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

3 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 
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Table 7 All Differential Data for Libs 12-14 



PCT/US98/27610 



Clone Name 


Cluster ID 


M00003958A:H02 


1895 / 


M00003958C:G10 


4U450 


M00003958C:G10 


40455 


M00003968B:F06 


244© o 


M00003970C:B09 


A A 1 ""JO 

40122 


M00003974D:E07 


23210 


M00003974D:H02 


23358 


M00003975A:G11 


12439 


M00003978B:G05 


5693 


M00003981A:E10 


3430 


M00003982C:C02 


2433 


M00003983A:A05 


9105 


M00004028D:A06 


6124 


M00004028D:C05 


40073 


M00004031A:A12 


9061 


M00004031A:A12 


r\r\ f 1 

906 1 


M00004035C:A07 


37285 


M00004035D:B06 


17036 


M00004059A:D06 


5417 


M00004068B:A01 


3706 


M00004072B:B05 


17036 


M00004081C:D10 


15069 


M00004081C:D12 


14391 


M00004086D:G06 


9285 


M00004087D:A01 


6880 


M00004093D:B12 


5325 


M00004093D:B12 


5325 


M00004105C:A04 


7221 


M00004108A:E06 


4937 


M00004111D:A08 


/AH A 

6874 


M00004114C:F11 


13183 


M00004138B:H02 


13272 


M00004146C:C11 


5257 


M00004151D:B08 


1 /C AT7 

16977 


M00004157C:A09 


6455 


M00004169C:C12 




M00004171D:B03 


4^0 o 


M00004172C:D08 


1 1 /I CM 

1 14V4 


M00004183C:D07 




M00004 1 85C:C03 


1 1 A At 

1 1443 


M00004 1 97D:H0 1 


1 A 


M00004203B:C12 


1 A 1 1 1 

143 1 1 


M00004212B:C07 


2379 


M00004214C:H05 


11451 


M00004223A:G10 


16918 


M00004223B:D09 


7899 


M00004223D:E04 


12971 


M00004229B:F08 


6455 



lones in Clones in Clones in 
Libl2 Libl3 Libl4 



ft 


o 


0 




o 


0 




o 


0 


o 


o 


0 


o 


0 


0 


n 

u 


o 


0 


A 

u 


o 


o 


ft 


o 


0 


o 

w 


o 


0 


A 
\J 


o 


o 


Z 


A 


o 


A 
U 


u 


o 


A 
U 




o 


A 
U 


1 
l 


o 


A 
U 


A 
U 


o 


A 
0 


A 
U 


o 


A 
0 


A 
U 


o 


A 
0 


A 
U 


o 


A 
0 


A 
U 


o 


0 


A 

u 


o 


A 

U 


A 
U 


o 


A 

u 


A 
U 


o 


A 


u 


o 


A 
U 


A 

u 


o 


A 
U 


A 
1/ 


o 


A 

u 


A 

V 


o 


A 
U 


A 
V 


o 


A 
U 


0 


o 


A 
U 




o 


A 
U 


n 


o 


A 
U 


o 


o 


A 
U 


o 


0 


A 

u 


o 


1 


n 


o 


0 


n 

V 


o 


0 


n 

u 


o 


0 


o 


o 


0 


o 


o 


0 


o 

VF 


o 


0 


7 
*i 


o 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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Table 7 All Differential Data for Libs 12-14 



Clone Name 



Cluster ID 



Clones in 
Libl2 



Clones in 
Libl3 



PCT/US98/27610 



Clones in 
Libl4 



M00004230B:C07 


7212 


0 


0 


1 


M00004269D:D06 


4905 


0 


0 


0 


M00004275C:C1 1 


16914 


0 


0 


0 


M00004283B:A04 


14286 


0 


0 


0 


iVi IfUUUHi O J X5 . £1U 0 




0 


0 


0 


M00004295D:F12 


16921 


0 


0 


0 


M00004296C:H07 


13046 


0 


0 


0 


M00004307C:A06 


9457 


1 


0 


0 


M00004312A:G03 


26295 


0 


0 


0 


M00004318C:D10 


21847 


0 


0 


0 


M00004372A:A03 


2030 


0 


0 


0 


M00004377C:F05 


2102 


0 


0 


0 
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1. A library of polynucleotides, the library comprising the sequence information of 
at least one of SEQ ID NOS: 1 -844. 

5 

2. The library of claim 1 , wherein the library is provided on a nucleic acid array. 

3. The library of claim 1, wherein the library is provided in a computer-readable 

format. 

10 

4. The library of claim 1, wherein the library comprises a differentially expressed 
polynucleotide comprising a sequence selected from the group consisting of SEQ ID NOS:9, 
39, 42, 52, 62, 74, 1 19, 172, 317, and 379. 

15 5. The library of claim 1 , wherein the library comprises a polynucleotide 

differentially expressed in a human breast cancer cell, where the polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NOS: 4, 9, 39, 42, 52, 62, 65, 66, 68, 
74,81, 114, 123, 144, 130, 157, 162, 172, 178, 183,202,214,219,223,258,298,317, 338, 
379, 384, 386, and 388. 

20 

6. The library of claim 1, wherein the library comprises a polynucleotide 
differentially expressed in a human colon cancer cell, where the polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NOS: 1, 39, 52, 97, 119, 134, 172, 
176, 241, 288, 317, 357, 362, and 374. 

25 

7. The library of claim 1 , wherein the library comprises a polynucleotide 
differentially expressed in a human lung cancer cell, where the polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NOS: 9, 34, 42, 62, 74, 106, 1 19, 
135, 154, 160, 260, 308, 323, 349, 361, 369, 371, 379, 395, 381, and 400. 

30 

8. An isolated polynucleotide comprising a nucleotide sequence having at least 90% 
sequence identity to an identifying sequence of SEQ ID NOS: 1 -844 or a degenerate variant 
thereof. 

313 
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9. An isolated polynucleotide according to claim 8, wherein the polynucleotide 
comprises a seqeuence encoding a polypeptide of a protein family selected from the group 
consisting of: 4 transmembrane segments integral membrane proteins, 7 transmembrane 
receptors, ATPases associated with various cellular activities (AAA), eukaryotic aspartyl 
proteases, GATA family of transcription factors, G-protein alpha subunit, phorbol 
esters/diacylglycerol binding proteins, protein kinase, protein phosphatase 2C, protein 
tyrosine phosphatase, trypsin, wnt family of developmental signaling proteins, and 
WW/rsp5/WWP domain containing proteins. 

1 0. The polynucleotide of claim 9, wherein the polynucleotide comprises a 
sequence of one of SBQ ID NOS: 24, 41, 101, 157, 291, 305, 315, 341, 63, 1 16, 134, 136, 
151, 384, 404, 308, 213, 367, 188, 251, 202, 315, 367, 397, 256, 382, 169, 23, 291, 324, 330, 
341,353, 188, 379, and 395. 

1 1 . The polynucleotide of claim 8, wherein the polynucleotide comprises a 
seqeuence encoding a polypeptide having a functional domain selected from the group 
consisting of: Ank repeat, basic region plus leucine zipper transcription factors, 
bromodomain, EF-hand, SH3 domain, WD domain/G-beta repeats, zinc finger (C2H2 type), 
zinc finger (CCHC class), and zinc-binding metalloprotease domain. 

12. The polynucleotide of claim 11, wherein the polynucleotide comprises a 
sequence ofoneofSEQ ID NOS: 116,251,374,97, 136,242,379,306,386, 18,335,61, 
306, 386, 322, 306, and 395. 

13. A recombinant host cell containing the polynucleotide of claim 8. 

14. An isolated polypeptide encoded by the polynucleotide of claim 8. 

15. An antibody that specifically binds a polypeptide of claim 14. 

16. A vector comprising the polynucleotide of claim 8. 
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17. A polynucleotide comprising the nucleotide sequence of an insert contained in a 
clone deposited as ATCC accession number xx, xx, xx, xx, xx, xx, xx, xx, or xx. 

18. A method of detecting differentially expressed genes correlated with a cancerous 
5 state of a mammalian cell, the method comprising the step of: 

detecting at least one differentially expressed gene product in a test sample derived 
from a cell suspected of being cancerous, where the gene product is encoded by a gene 
corresponding to a sequence of at least one of SEQ ID NOS:4 5 9, 39, 42, 52, 62, 65, 66, 68, 
74,81, 114, 123, 144, 130, 157, 162, 172, 178, 183,202,214,219,223,258, 298,317,338, 
10 379, 384, 386, 388, 1, 39, 52, 97, 1 19, 134, 172, 176, 241, 288, 317, 357, 362, 374, 9, 34, 42, 
62, 74, 106, 119, 135, 154, 160, 260, 308, 323, 349, 361, 369, 371, 379, 395, 381, and 400; 

wherein detection of the differentially expressed gene product is correlated with a 
cancerous state of the cell from which the test sample was derived. 

15 19. The method of claim 18, wherein said detecting step is by hybridization of the 

test sample to a reference array, wherein the reference array comprises an identifying 
sequence of at least one of SEQ ID NOS:l-844. 

20. The method of claim 18, wherein the cell is a breast tissue derived cell, and the 
20 differentially expressed gene product is encoded by a gene corresponding to a sequence of at 

least one of SEQ ID NOS: 4, 9, 39, 42, 52, 62, 65, 66, 68, 74, 81, 1 14, 123, 144, 130, 157, 
162, 172, 178, 183, 202, 214, 219, 223, 258, 298, 317, 338, 379, 384, 386, and 388. 

21. The method of claim 18, wherein the cell is a colon tissue derived cell, and the 
25 differentially expressed gene product is encoded by a gene corresponding to a sequence of at 

least one of SEQ ID NOS: 1, 39, 52, 97, 119, 134, 172, 176, 241, 288, 317, 357, 362, and 
374. 

22. The method of claim 18, wherein the cell is a lung tissue derived cell, and the 
30 differentially expressed gene product is encoded by a gene corresponding to a sequence of at 

least one of SEQ ID NOS: 9, 34, 42, 62, 74, 106, 119, 135, 154, 160, 260, 308, 323, 349, 
361, 369, 371, 379, 395, 381, and 400. 
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SEQUENCE LISTING 

<110> Lewis T. Williams 

Jaime Escobedo 

Michael A. Innis 

Pablo Dominiguez Garcia 

Julie Sudduth-Klinger 

Christoph Reinhard 

Klause Giese 

Filippo Randazzo 

Giulia C. Kennedy 

David Pot 
Altaf Kassan 
George Lamson 
Radoje Drmanac 
Radomir Crkvenjakov 
Mark Dickson 
Snezana Drmanac 
Ivan Labat 
Dena Leshkowitz 
David Kita 
Veronica Garcia 
William Lee Jones 
Birjit Stache-Crain 

<120> Novel Human Genes and Gene Expression 
Products I 

<130> 2300-1480P 

<140> 09/ 

<141> 1998-12-21 

<150> 60/068,755 
<151> 1997-12-23 

<150> 60/080,664 
<151> 1998-04-03 

<150> 60/105,234 
<151> 1998-10-21 

<160> 844 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 300 

<212> DNA 

<213> Homo sapiens 

1 
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<210> 4 

<211> 287 

<212> DNA 

<213> Homo sapiens 



<400> 4 

aaancngcac gangccacgt ncgnnnngnt nntactnnnc natngccncn tcantggcng 
ncagctagac gcctaacagc cgangancca nccntnntgt ganccngtcn tgacngnnag 
cntgccggtc ttgctcnttt tgtctaccnn gagganannn ntntgggaca tcccagactg 
agtgaggaga tctgcngctg cnnctgtact tggttacanc ncacacgang actntncctt 
ggactanana cactagccta anattcngca ctacctantc ctctggc 



<210> 5 

<211> 300 

<212> DNA 

<213> Homo sapiens 



60 
120 
180 
240 
300 



<400> 1 

tctcccctga gctgcaggcc tgcatatcca gtaggtctac tggacatctg tactggttgt 
tgnggaggaa cctctggctt gctcattaag tcctactgat tttcactatc ccctgaattt 
ccccacttat ttttgtcttt cactatcgca ggccttagaa gaggtctacc tgcctccagt 
cttacctagt ccagtctacc ccctggagtt agaatggcca tcctgaagtg aaaagtaatg 
tcacattact cccttcagtg atttcttgta gaagtgccaa tccctgaatg ccaccaagat 

<210> 2 
<211> 299 
<212> DNA 
<213> Homo sapiens 

<400> 2 

cccagctgct caggaggctg aggcaggaga attgcttgaa cccaagaggc ggaggttgtg 
gtgagccgag attgcacctt tgtactccag cctgggcaac gagcaaaaaa ctctgtctca 
aacaaaaaga agaaaaaaaa aaaaaaaann nnnnnnnnnn nnnnnnnnnn nnnttnttct 180 
ggcgncnagt cccaaantcn taccttgtaa gacctttann tnncctgngg tntttntnna 240 
cncttanata nnnntntttn ctatcaanta tagggagant tttcntttng gggcaactt 299 

<210> 3 

<211> 300 

<212> DNA 

<213> Homo sapiens 



60 
120 



<400> 3 

atacgattcg aacnnggaca agacgagtat ggaataatat cccactnnnt ttacaatact 
ganattatgc ngngatagng cttgttccat tcnaccagcg aatnatgcat tnacncnaca 
cnngagttac tatccaaaca cacgttttca cgntacctga ngctggtnga naattatgcg 
accatgaggc tttccangat ntttctannt ancagacngn gnacaatgnt gaanaagcng 24 0 

tacacaccgc nctngncrmc cnncactgan cangtnacnn ngctcactgn ngcctctttc 300 



60 
120 
180 



60 
120 
180 
240 
287 



60 



<400> 5 

gtccttttga accaccccaa agaactcaac atggcaaagc aaatggtaaa agcttcccga 
ctgttctact ttgggtccgc gcgaagccca ctcacgtgtg atctgtgttg cccctgggag 120 
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gcccggggcg accggaaaag ggctctctca agttctgaaa agagaatctg ccaccacatc 
gaatttcgac ccctgagctt gttcggacgt atggtccaaa t?cJg.tt2 ggtggtcacc 
caacccgaga tgtcaggaaa ggccttctgc agagaaaatg tccccccacc cgccatctgc 



<210> 6 

<211> 284 

<212> DNA 

<213> Homo sapiens 

<400> 6 



ccnSttn Illicit? tgCCCtnnC * c ^nngtcc nttaggcctt atcccatcgn 
ocntcgtttn gcattctgcc nnagantgac tttncnatca tgcntnatnn gtnnttacna 
ggggctnggg tgaattntta caccctgcna ntccatanca cantgccntg S^ctJcac 
cctcntgaat aaatgcaata aantttcngt tgatcttata cacc?tatgc nccntantta 
atcagccctn tnttacnana tcnanttatg cnggtattaa aaca nccnta »"* 



<210> 7 

<2ll> 277 

<212> DNA 

<213> Homo sapiens 

<400> 7 



STtgctgcaga caacacacct tcctgatgga ggtgtccggc tgatggagaa gtctgtcxaac 
S ° a tc "tgatgt taaccaggcc gacgctgtgg ccacStccg aaagattSc 
Ztll,? CCtaimnnnn nnnnnnnnnn nnnggatttg atnagcctgt nccanacctc 
anCggt " 3tn -taccatagt ggggatgacc ctctgata^t ttgncctggt 
ngancatgnt gacanntgct tctacagctt nngggac 9 33 



<210> 8 

<211> 292 

<212> DNA 

<213> Homo sapiens 



<400> 8 

cttgggaggc tgagtcagga gaattgcttg agcccaggag atggaggttg caataaacca 

:iit::ii:: ::ti:::r: agactggg ~ -c^is J^iiiti 22s 

aaata^^n Cattta9tat a ^99999t gggcgggaga aataatgtta tttcctatgc 
aaatgacgnn nnnnnnnnnn cccatggtaa atgtnaatat actgcgtctn ttttgaaana 
gccttttant aaangagtct tanatgaatc tctanntnat ganitLacJ ^ " 9 



<210> 9 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 9 

^rf 9 ^ a9C t9CtSfaatca aagcttcaaa cagaagttaa agaaggaaaa gaaacttcaa 
gcaaattgga aaaagaaact tgtaagaaat cacaccctat tctaKtgtg tcttctaaat 
tlcttalT gaCCCa3tgC «*°™*9t aaagactttt ctttaata^ agtacggtgc 
mllftT, 3aaa9ttact atggtgctta agattgtctt gatctgacat aLtcIcc^t 
ctgggttatt tactcattgt gccaggacct ggcattttca tgtgcctttg accaagtgtt 



180 
240 
300 



60 
120 
180 
240 
284 



60 
120 
180 
240 
277 



60 
120 
180 
240 
292 



60 
120 
180 
240 
300 
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<210> 10 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 10 

aggaggcgga gcttgcagtg agctgagatc gcgccactgc actccagcct gggcaacaga 
gtgagactct gtctcaaaaa aaaaaaaaaa nnnnnnnnnn nnnnnnnnaa nctcgtnttn 
gnaaggaaan ggggnaangg accggtntta tncctatgtn gtntttgcag gcaaangaaa 
nggacccntt tttgtaaaaa aaagtctttt gnncaantaa acggggtntg ngggtncagg 
ccctggnggg gcncncantt gcctggnggc ttntgnnaaa tcggnaaagg gaggaaaggc 



<210> 11 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 11 

cgtctgtaat cccagctgct tgggaggctg aggcaggaga atcacttgaa ccctggaggt 
ggcggtttca gtgagcacag atcatgccac tgcactccag cctgggcaac aaaacgagac 
ttcgtctcaa aaaaaaaaan nnnnnnnnnn nnnnnnnnnn nncgggttct cccaaattnt 
ttnnaggggn ccatggncaa ctgnttnacn tttgtttngg naaccccntg cccnaagncg 
cananaggct gtnnttnncc ttgttnccaa ggntgaggan caaaaagtac cctntgtttt 



<210> 12 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 12 



<4UU> x& 

caaagatggt cgtattacta aaggtgaata accagcgcgg ggggcacgtg gagtcactgg 
aacatttgtg caatgctggt gggaatgtca acccgtgcgg ccctctggaa taagcctggc 
agctcctcca agagttaccg tgtgacccag caattccact cctagctcca cccacaggaa 
ttgaaagcaa agacgcaaac agatgcctgt gcaccaaagt tcacggcagc atccttcgcc 
atagtggcag catccgtcgt cacagcggca tcatccttca tcatagcggc agcatccgtc 



<210> 13 

<211> 278 

<212> DNA 

<213> Homo sapiens 



<400> 13 

cctgcagcca ctaatgcatt gtgtatgata acaaaaactc tggtatgaca cattttctgt 
gatcattgtt aattagtgac atagtaacat ctgtagcagc tggttagtaa acctcatgtg 
ggggaggtgt gggaggtttt nncgnnannn nnngcnnnnn annccccggn nngnnngaag 
ctgnnnnttn naannngcnn nnnannatga naannnccnn ngactggnnn nangaggcct 
ancccntgnt ttananaaac nnncnncagn ntctctca 



<210> 14 
<211> 300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
278 
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<212> DNA 

<213> Homo sapiens 

<400> 14 



I™""* 1 Cttaccca ^t ggaacctaag aaattaaatt ctccagaaga aactgctttt 

ct^tttct 3 : ? a t:t? t T c r t9cc , tc " t ::ti:it? a 

tlfl^lt caaaattatc tacacctgat gttgtgagcc catttgggac cccatttciqc 
allll? T t9aatC ^ at STSrctggaatt tttgatgtaa acacctgcta tgggtcaccg 
caaagtcctc agctaataag aagggggcca agattgtgga catcagcttc 



<210> 15 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 15 



gttatattaa attattcttt gtttttcttt ttettttaat aaaacntao, a „„.^„ 

22S S— SS as ~= 5~ 
=K= S5SS - = -~ ~ 



<210> 16 

<211> 276 

<212> DNA 

<213> Homo sapiens 

<400> 16 



<210> 17 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 17 



ggtgcccatc accacaccca gctaactttt gtatttttag tagagacggg gtttcaccat 
gttggccagg ctggtcttga actcctgacc tcgtgatccg cccgcctfgg SccgcJaaa 
tgctgggatt acaagcatga gcccagcgcc tggctgtatc tttcatttta cccaScac 
actata^aa taagtaatta ***—gca tgagtcttgt accacctgtt c^SS 
actgtgggaa acggagccaa cggacctaag tgccctttga cagtgag?tt catacSSt 



<210> 18 

<211> 273 

<212> DNA 

<213> Homo sapiens 



60 
120 
180 
240 
300 



120 
180 
240 
300 



gtttcattta agaagaatga gctagataaa tgtgctcttc tggttacccc aeeetflfl ™ 

s^: £sss r 3393it9 a9act3c ^ ~™ 

iSSS^r ^^"nact tcccngaanc actataattg gnanacnttn ctaannggtn 180 

atctngccga cctgnnagat anactcnnga taaaanccnn tgcagaaagc gcccScat III 

gtcangcncn tnaganacnn ncntaccncc tangna gcccttccat 240 



60 
120 



60 
120 
180 
240 
300 
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<400> 20 

atggcatgca ctgacctctt cttggagccc agaactttat agagttgcct accagggtta 
ctgtaatgga atttatgatc ttaagaaatt actagttgta ttatttatcc tatgattcat 
tcattcaata agcttttact gcataaactt tacatccagc actgtagtta agtacccaaa 
attgaataga aataatggct tttgaaaatc gcacaaagca ggccaggcac ggtggctcac 
gcctgtaatc ccagcatttt gggaggccga ggcaggcgga tcacgaggtc aagagatcca 



<210> 21 

<211> 293 

<212> DNA 

<213> Homo sapiens 



<400> 21 

cgtctgtaat cccagctgct tgggaggctg aggcaggaga atcacttgaa ccctggaggt 
ggcggttgca gtgagcacag atcatgccac tgcactccag cctgggcaac aaaacgagac 
ttcgtctcaa aaaaaaaaan nnnnnnnnnn nnnnccttng gncgggttnt cccaaattnt 
tttgaggngn C catggncaa ctgcttnanc tttgttttgg caaccccntg cccnaagtcg 
catataggct gtncttcacc ttgtttccaa ggctgnggaa canaaagtaa cct 



<210> 22 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 22 

ctggtctcga acacctgacc tcaggtgatc cattcgtctt ggcctctcga agtgctggga 
ttccaggcgt gagccactgc ggccagcaca tttccacttt tagatcctac tccataccac 



60 
120 
180 
240 
273 



<400> 18 

ctcagctgag gcaattaaac tggaaaagaa atagattgaa aagatactac agaagaagca 
gtacagaagt tgggggactg aaggagaggg agccactgca ggtgctagct gcttaagggg 
ataccagtcc ttttacagat ataatagata cagcttctga ggtggagggt gataggagtg 
tgtagagaaa ttgcagttca gaactggagc atgcagttag gcaagaggca tcccatgtga 
agatgtcaag caagtactgg aaaatgctga act 

<210> 19 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 19 

gggtcctggt gggagttcca tccagcagtg agtgcatttt ttccccagag cagttaaggg 

tcttattaaa agccaccact ttgctgaggc ctgtacaggc cttgggggtt tggggaagag 

aaataaggca ggcacttgtc ccttcaggga gggacttgtc cctcactggg aggtttgggg 180 

ttgaccttgg ctccagcaga gatacccagc ctggcgtgga aggggcaggt ctgagcttac 

gcttgactgc agggcaagct gcaggcctct tctgccttcc cctgcattca ccaaggacag 

<210> 20 
<211> 300 
<212> DNA 
<213> Homo sapiens 



60 
120 



240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
293 



60 
120 
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aggtttcatt taagaagaaa gagctagata aatgtgotct tctggttacc ccaccctgac 180 
gSatattta "'—PSSre tagcaggggt tgagactgca gccggcctg ccagccattg 240 
gaggtgttta aggaagggca gataatgtga ctctttgcgg ggtgccatct gcttacccat 300 

<210> 23 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 23 



ttlllT, 5 C9t ^ at ^ gtgttctctt gtccttatcg acttgctctg ctcccagctt 
tccaagcgac cggatctgag tgatgcttct agaacatttg ggtgttgggg ggttcccaat 
agtagaaagg gtccccattc ctgctcagca ccgcacctct HaLcSS acagacacac 
atgcagacac acacatgcag acaacacgca gacacacaca tgcaggcact cacatgcagg 
cccatgcaca cacacgtgca cacacatgca gagacatgca gacacgcagg cacacatgca 



<210> 24 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 24 

TaTaltaTr. t aCg t 3 " aa ttcaa ^atga gatttggttg gggacacagc caacccatat 
cacccatgcc tggatgccct tctcatgctt gggttctgtc atctgcacca ggccttctgc 
aacSoctf ! Ctta ! CCa ? «OT-ctctg actctccacg ctgggccacc Scttctcc 
aacactgcta tggattgaat gtttatgtta tccccaaatt tgcatgttgc aatcccaatc 
tccaatgcca tagtattagg aggtgggggc ctttgggagg tgatttggL atgaaggtgg 



<210> 25 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 25 

?? aaaat9aa atctgactat ctgctagttg ccaaaaccca gaaacattcc tgtgtaatgg 
ttagttggga aagaaggcag cacttgaaaa aatttaccag gttcctcact gggagatgS 
ggaaggggcg tgggacgcac gcggtcactc cctctcagcc ccccacattt cSgaacaca 
Ittll 9 9 gcctctacag actcccgctg cctggcctcc acagatcctg ctcagattca 
ccagtaggca aagcttggcc ctattagctt tttctctcca tggctctgtg ggaa^gtgcg 



<2X0> 26 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 26 

tcccS? 39 a " CtCtgCa "gactggcc tcagcaaggg ggcagcttag gaccctgaca 
tcccaggtca ctaagccaca taggataagt aatgggtgga cagaagcggg Laggagaag 
ggcagggcac atgtttaaaa cttgaacttt ctgaggctaa gactggaaaa ggaEgXt? 
cagctgatat atttggatac cagttgacta tttttaggaa LaaSacaa atggcSta 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
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aacatcacag tgtgatacag tctaactcag aattagagac aggcaaaaca gaactccatc 

<210> 27 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 27 

gtactgcttc tgtggctctt cacagacctc acggatgtga ccggagatga gtgccgatga 
ccacgtttta aaggagaaag agagctcctg gtggggccct cggggtggtc tcaggtccca 
tttgcagtct gcaacagtga cgcgcagccc ggtccggagc gtggtgagct ttgtttgcct 
tctgggtcag ctttcgctgt gtctcctgtg tgtgttagaa tccagagccc agaggaagtg 
caagcgggtc ctccgccaac ggggagagcc tcttcgcggc gctgttggcg acagcacgct 

<210> 28 
<211> 298 
<212> DNA 

<213> Homo sapiens 
<400> 28 

aangnaannn nggggngttg antcnacctn ngaaccgtgt anaaacccat ggaaacagct 
antaganntt gggcagganc agagngaggc caagntacgg gggaggcnag gagcngagan 
tggggnnnnn nnangnnaan tnnngaaggg gngngannga gggggggana naagggggga 
ngagggcgaa ngncaggann nagaaaannn ggggacgana nggngaacag ggnnnaaacg 
gaannnngga gnnnnnanag atgncgggca gngncngngn aggnganann ngagacgg 

<210> 29 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 29 

cctcagcccc acaccagctc tatttcaggg gtgagagtca gagagcactg caatatgtgc 
ttcatgggat ttcgattcga agatcctaga ccagggagac actgtgagcc agggatacaa 
caaaatacta ggtaagtcac tgcagaccga cctccctgca gtttgggaaa gaagctgggt 
ttgtggagaa tcagagcatc ttgacatgac tgctgaccta aagatccctg gcattggcca 
gggatcctgt ggaacctctt ctagttcagg ggtgtgagca ttagactgcc agttgtctag 

<210> 30 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 30 

gtttgtttcc ccgagatgtg aacttgctga aggaaaacag tgtaaagagg aaggccatac 
agagaactgt cagctcttca ggatgtgaag gcaagaggaa tgaagacaag gaagcagtga 
gcatgttggt taactgccct gcctactaca gtgtgtctgc tcccaaggct gagctactga 
acaaaatcaa agagatgcca nnnnnnnnnn nntgaggaag aggaacaggc anatgtcaat 
gaaaagaagg ctgatctcat tggaagtctc acccacaagc tggagaccct ccaggaggcg 

<210> 31 



300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
298 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
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<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 31 



acccctctcg 


ggtagagttt 


tcatggtgga 


60 


tttttggcac 


agaaggcctg 


ggccattttc 


120 


gaactccgta 


ggttgttgcg 


ttcactgcag 


180 


aacggagcct 


cccccatgca 


gcccccactc 


240 


acgggggcgt 


ttctcaccgt 


ctgaatgcgg 


300 



<210> 32 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 32 



gtgaaacaga aagtggagat gctttccttg acctgaagaa gcctcctgcc tccaaatgcc 
ZatT^T taCaaaa * aa 9-ctcttgg atataaaaga actcccccat tccaaacaga 
ScctS C ! tttCtgaa — t«tg«ca gtgatggtgt ctgggaccct gagaagtggc 
atgcctctct ctacccagct tcagggogga gctcaccagt ggaaagtctg aagaaagagt 
tggatacaga ccggccttco ctggtgogca ggatagtaga tccacgagag cgtgtgLag 



<210> 33 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 33 

gtctgattga agctgttcag gtttatcatg caaatcctcg cctctggcta cggctggctq 
aatgctgcat tgctgccaat aaggggactt ctgaacaaga aactaaaggc cttcccagca 
aaaaaggaat tgtacagtct attgttggtc aaggctatca tcgtaaaata gttttggcat 
cacagtctat acagaatact gtttataatg atgggcagtc ttcggccatt cctgtagcca 
gtatggagtt tgcagccata tgtctcagaa atgccttgtt gctgctacct gaagaacagc 



<210> 34 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 34 

tgacagagct gttcagcgta caccagatcg atgagctggc caagtgcaca tcagacactg 
tgttcctgga gaagaccagt aagatctcgg accttatcag cagcatcacg oaggactacc 
acctggatga gcaggatgct gagggccgoc tggtacgcgg catcattcgc attattaccc 
^? ag r g tgCtCgCCca cagacctcgg agggtcgttc aactcgggct gctgccccaa 
ccgctgctgc ccctgacagt ggccatgaga ccatggtggg ctcaggtctc agccaggatg 



<210> 35 

<211> 300 

<212> DNA 

<213> Homo sapiens 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



9 



BNSDOCID: <WO 9933982A2J. 



PCT/US98/27610 
WO 99/33982 



cttttttaag caaagcagtt tctagttaat gtagcatctt ggactttggg gcgtcattct 60 
taagcttgS gtgcccggta accatggtcc tcttgctctg attaaccctt ccttcaatgg 120 
actSttcac ccagacacca aggtatgaga tggccctgcc aagtgtcggc ctctcctgtt 
Scaaaaac attctaaagc eStgttctt gcttcatgga caagaggcag ccggagagag 
tgccagggtg ccctggtctg agctggcatc cccatgtctt ctgtgtccga gggcagcatg 



gggaaggagc gggcgtgagg ccagctgagg catggtgacc cctgggaagg *gcgggcgtg 
aggccagcig aggcatggcg acccctggga aggagcgggc gtgaggccag ctgaggcatg 



180 
240 
300 



<210> 36 
<211> 300 
<212> DNA 

<213> Homo sapiens 

actaaccaargccaaatctc ctaagtccac cgcccaggag ggaaccctga agcctgaagg 60 
agttacggag Iccaaacatc cagctgcagt tcgcctccaa gaaggggtcc atggccctag 120 
SaScat Jtgggctctg gggaccatga ctattgtgtc cggagcagga cccccccaaa 180 
aaagatgcct |ccctagt:ca Sccagaggt gggctcccga tggaatgtca agcgccatca 240 
ggacatcacc Lcaaacctg tcttgtcctt gggcccagct gcccttcgcc ccatgcatag 300 

<210> 37 
<211> 300 
<212> DNA 

<213> Homo sapiens 

atccaaqgaraacttcgaga catttctttt tgccaccgta tctaacaggg agcaggaaga 60 
Stctgccga ggaattgtcc agctctgctt caatgagcaa agcoaacagc tgctagcaga 120 
ggtccagccc StgacLtt tcctcatggt agagacaact gcatactttg aggcctacag 180 
acacqSctg gaaggaotcc aggaggtcoa ggaggaagat gttcccttcc agaggaatat 240 
cgtggagtg? aacStcatg tgaaggagcc aaggtacttg ctaatggggg gcagatatga 

<210> 38 
<211> 300 
<212> DNA 

<213> Homo sapiens 

catccagggrgaacctcggg gctgggacac ctcctggccc tcaccctggg tcatgtttac 60 
agtcctcagt gccccacacc ggtggccccc tgaggacacc tccaccctga ^^tgatttt 120 
cccaaacgct Lctcttggt gacagactca gcccaaaacc ccttccttct gtctctggag 180 
acccttgagc JtggggaSt atggaggggt gtgtgtctgc aatcaaggcc tctgcagctc 240 
acggctggcc cggtgggctg ggacttccgt atgaattnta aatacttagg gttcttttt 300 

<210> 39 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 39 



60 
120 
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gtgacccctg ggaaggagcg ggcgtgaggc cagctgaggc atggtgaccc ctgggtacgg 180 
gggacttggg ggccgcacct tggtttgccc agggcccctc ctgcaccacg ggccacatgc 24 0 

ggaggacggc gtgggatagg ctccctgggt ccacagcttc tgcccgtgta tggggaaccc 300 

<210> 40 

<211> 300 

<212> DNA 

<2l3> Homo sapiens 



<400> 40 



ccaaaagctt gtggcaaatt tgaaatttct gccattaggg accttacaac tggctatgat 
gatagccaac ctgataaaaa agctgttctt cccactagta aaagcagcca aatgatcacc 
ttcacctttg ctaatggagg cgtggccacc atgcgcacca gtgggacaga gcccaaaatc 
aagtactatg cagagctgtg tgccccacct gggaacagtg atcctgagca gctgaagaag 
gaactgaatg aactggtcag tgctattgaa gaacattttt tccagccaca gaagtacaat 



<210> 41 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 41 



aaaaggtccc ccttctggga aagaccgagt gaagaaaggt ggatcctaca tgtgccatag 
gtcttattgt tacaggtatc gctgtgctgc tcggagccag aacacacctg atagctctgc 
ttcgaatctg ggattccgct gtgcagccga ccgcctgccc actatggact gacaaccaag 
gaaagtcttc cccagtccaa ggagcagtcg tgtctgacct acattgggct tttctcagaa 
ctttgaacga tcccatgcaa agaattccca ccctgaggtg ggttacatac ctgcccaatg 



<210> 42 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 42 

ttctaagtca ggagtacagt acaaaggaca tgtggagatc cccaatttgt ctgatgaaaa 
cagcgtggat gaagtggaga ttagtgtgag ccttgccaaa gatgagcctg acacaaatct 
cgtggcctta atgaaggaag aaggggtgaa acttctaaga gaagcaatgg gaatttacat 
cagcaccctc aaaacagagt tcacccaggg catgatctta cctacaatga atggagagtc 
agtagaccca gtggggcagc cagcactgaa aactgaggag cgcaaggcta agcctgctcc 



<210> 43 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 43 

gccaccgaag cttcaggatg acatcttaga ctctcttggt caggggatca atgagttaaa 
gactgcagaa caaatcaacg agcatgtttc aggccccttt gtgcagttct ttgtcaagat 
tgtgggccat tatgcttcct atatcaagcg ggaagcaaat gggcaaggcc acttccaaga 
aagatccttc tgtaaggctc tgacctccaa gaccaaccgc cgatttgtga agaagtttgt 
gaagacacag ctcttctcac ttttcatcca ggaagccgag aagagcaaga atcctcctgc 



60 
120 
180 
240 
300 



60 
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180 
240 
300 
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<210> 44 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 44 



ggcttataca^catagtggg gaacgcatgg gaatggactt cagactggtg ^tgttcat 
cattctattg aagaaacgct taacccaaaa ggtccccctt ctgggaaaga ccgagtgaag 
Ull^Tsll ccLcatgtg ccataggtct tattgttaca ggtatcgctg tgctgctcgg 
aqccagSca cacctgatag ctctgcttcg aatctgggat tccgctgtgc agccgaccgg 
Sgcccacta tggac^gaca accaaggaaa gtcttcccca gtccaaggag cagccgtgtc 



<210> 45 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 45 



gtggaagaaa attttttgct gcttctggtt cccagaaaag ggagccattt taacagacac 
atSgtcaaa agaaatgact tgtcgattat ttctggctaa tttttcttta tagcagagtt 
tctcacacct glcgagctgt ggcatgcttt taaacagagt tcatttccag taccctccat 
caatacaccc ScttLaga aLtgaactt atgcaaatag acatccacag cgtcggtaaa 
ttaalgggtg aSaccaalt ttcataatat tttcccttta taaaaggatt tgttggccag 



acacagataa ttttaataca atgtgaaaaa gtgtatgggt gtgtagaaga J^ttctta 
gagtttctgg agagaatgat tctgagctcg gttttgacaa aagaggagct ^tgaggcta 
aaaataaatq aaaagggcct tataattaaa agaaacaaga caggactcag aggtgtgaaa 
T-t^a. «acaa tg a g tt^tat t=t ? ta ? ccc «a««=a, 
ggtataaaga gacagaaaat gatcctggaa tatagacaga ggatacttca tctctcatga 



<210> 48 
<211> 300 
<212> DNA 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



<210> 46 
<211> 300 
<212> DNA 

<213> Homo sapiens 

ataaaaqaaa^tttttgct gcttctggtt cccagaaaag ggagccattt tangngacac 60 
atSgtcaaa agaaatgact tgtcgattat ttctggctaa tttttcttta tagcagagtt 120 
tctcacacct ggcgagctgt ggcatgcttt taaacagagt tcatttccag taccctccat 180 
caScaccc ScttLaga aaatgaactt atgcaaatag acatccacag cgtcggtaaa 240 
ttaaggggtg atcaccaagt ttcataatat tttcccttta taaaaggatt tgttggccag 300 

<210> 47 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 47 



60 
120 
180 
240 
300 
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<213> Homo sapiens 



<400> 48 

gatggaacat gagtggaagt gggcagtctt tttctttccc tatcagctga gtgaatgaag 
atttagaggg cagcagagtc atgacatgga tgacgttggg tc tctggatg gctaaatgga 
agacccgccc cccaacgcca ctctaccccc ctgctttgaa ctatgctttg agaaatgagc 
ttatgagacc actgagactt gggggctgtt tgttcagcag ttcacctaca cttattagga 
aaggttgact tcttgtaact acgcctttcc ttaaatcatc ttttgtataa ttctcagaag 



<210> 49 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 49 

ccctccccgg cttcccccgg agtgggtcac cacactgttt tttatcatca taoaa^r^ 

f C T tg ^ f " gCtggt ^cttcccac tggcgaagag algctacaaa 
atatgctcga tggatagcat tcactggaac cactatgaga agattatagg aaaaacacca 
agactagagg actctgggtt ccttttatgc aaagtcaact cttctgggtc acagttaccc 
agcaacaaaa ataaagagag gaccaggacg atgccagcac cccgtttatc ctgagtgaac 

<210> 50 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 50 

ctcctgtctc agcctcctgg gtagctggga ctacaggtgc atgccaccat gcctggctaa 
cttttgtatt tttagtacag acagggtttc accacattgg tcaggctggt ctcgaactcc 
taacctcagg tgatccacct gccttggcct cccaaagtgc tgagattaca ggcgtgagcc 
accgcgcctg gcctgattgg ttttttaaca tgatttttct ctaagcttaa ataccacaag 
gccaaagaga aatggtcata atttaaacca ttattatatt ggtgaggtat ccctagctat 

<210> 51 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 51 

ggaggctaga ctcaagctgt ctggagagtg tgaaacaaaa gtgtgtgaag agttgtaact 
gtgtgactga gcttgatggc caagttgaaa atcttcattt ggatctgtgc tgccttgctg 
gtaaccagga agaccttagt aaggactctc taggtcctac caaatcaagc aaaattgaag 
gagctggtac cagtatctca gagcctccgt ctcctatcag tccgtatgct tcagaaagct 
gtggaacgct acctcttcct ttgagacctt gtggagaagg gtctgaaatg gtaggcaaag 

<210> 52 

<211> 300 

<212> DNA 

<213> Homo sapiens 



13 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
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<400> 52 



agtcatgtct 


ttaatggccc 


60 


cctagtgtct 


ggctttatgc 


120 


ttgctgtgag 


gattagatga 


180 


caagccatgt 


atccctaatc 


240 


agtttttgct 


caagattgca 


300 



<210> 53 
<211> 300 
<212> DNA 

<213> Homo sapiens 

«aaaagc"r g Stggtact actattatga acaacattgt tatttggaat ttaaaaactg 60 
atcaactcct gaaaaagatg cacattgatg attcttacca agcttcagtc tgtcacaaag 120 
cctattctga aatggggctt etettuttg tcctgagtca tccctgtgcc aaagagagtg 180 
agtcgttgcg aagcSgtg tttcagctca ttgtgattaa ccctaagacg actctcagcg 240 
tgggtgtga? gc^gtac^gt cttcctccag ggcaggctgg caggttcctg gaaggtgacg 300 

<210> 54 

<211> 300 

<212> DNA 

<213> Homo sapiens 



ccaagateerJitteeatg aagtcttgat ttatatatat gtacacatgt tatgcacata 
SgKgtt ttctaacagt tattttttaa gcttttgaga taattttaga c«acagaag 
agttgtaaaa gtagtagagt tcttgtatac tctgcaccca ccttgccctt ^gttaacat 
cttacgtaac aatagaacat ttgtcaaaat taagaaatta accttgatat ^^ctaact 
aaagtagaaa gtttaaaaag tagagatttt agtcttttca ctaatgtcct tttactgttc 



ctttgctrt^tcLtteeaa gttgttctct gttctagaaa gcagatgtag tagacatcta 
ctgtttttga ctaaacagaa tccctttttc ctttttttgt taaaagtact catccctaat 



€0 
12 0 
180 
240 
300 



<210> 55 

<211> 300 

<212> DNA 

<213> Homo sapiens 

gggagggacrcttgggggca ggttgtgggt agccagttgc agtctgtggc ^^caga 60 
ggSggagt cgggcgtggc atgotgctgt tggcctcttt ccgagggagt ^catccact 120 
ccctgtccca cogctgtccg cggtgaggac agtgagggca gtgctacgtg gtggggaggt 180 
gtgtgagaag ccacggaagg gcttcacagg gcagatgcca aggccagtgg ^^cggaca 240 
gagtcaggct ccctgggcgg ccttgtgtct tggtggccct gatcatcctg ccaatgcaaa 300 

<210> 56 

<211> 300 

<212> DNA 

<213> Homo sapiens 



60 
120 
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attacattgt tctggaagga ctgaaaataa cagaactcag caccatgatc ggaccgggac 180 
cttcttta^ " tCatt ; C J «~acgg agatcgatcc gaaaag^gga Statga^ct 240 
cttctttggt gttggcatat ggaccctgag agaaagaact ttaatttttt ctcttggact 300 

<210> 57 
<211> 276 
<212> DNA 

<213> Homo sapiens 



<400> 57 

cctccctgga tgtgcagaca tggaggagga cagaaggccc agctcagtgg cccccgctcc 
t^tt?? 0 * C * C «"W« gcaggggcag agccagnnnn nnntcgaagt gtgtccnngt 
tgtcttttga nccttgttnt ggngccttgc ctanatgtat ntnntntnnn LLntaaJt 
tnnnnnntnn ntnnnttnct nttnntaaat tgnttnnaan ttntnntann ttnttanatt 
nnnannnnnn ntantgtnnt gnattgntat nnatca 

<210> 58 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 58 

taStttcta !"^ CttgCC «te.cc.« tccctagtac tgggtatcag tctggccact 
tggctttctg gtttgcccca atgtggtcta ttcttgatgc agctaccaaa gtaatgtttt 
lltltl t taccaa ^ tta ef tcettgt caaaaccccc agtaactgcc Ltctcactt 
agaataaaat ccggactcct gtgaagcaca gcataaactg gooactgcct atgcagcaac 
ctcatcttta ccgtttcctg ccttgctcac tcccttccag cgccgtLtt ctLctgatg 

<210> 59 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 59 

gacccaggta gaccagctca agagttcatg ttctttgtca tcctngtgtg agctctctgt 
tSctSL" CttgCCCatC — tccc tagtactggg tatca^tctg gccacttggc 
ttltl f ^ ccc aatgt ggtctattct tgatgcagct accaaagtaa tgttttaaaa 
ccattatacc aagttactat ccttgtcaaa acccccagta actgccaatc tcacttagaa 
taaaatccgg actcctgtga agcacagcat aaactggcca ctgcctatgc agcaacctca 

<210> 60 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 60 

gggtcctggt gggagttcca tccagcagtg agtgcatttt ttccccagag cagggtaagg 
aaa^L aagCCaCCaC "tgctgagg cctgtacagg ccttgggggt ttSg«S 
gaaataaggc aggcacttgt cccttcaggg agggacttgt ccctcactgg gaggtttg^g 
cacttaacto *f ^ce«g cctggcgtgg aaggggcagg tctgagcSa 

cgcttgactg cagggcaagc tgcaggcctc ttctgccttc ccctgcattc accaaggaca 



60 
12 0 
180 
240 
276 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
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<210> 61 

<211> 292 

<212> DNA 

<213> Homo sapiens 

<4 ° 0> iLr a tcc cctctgggaa gcagaagcct ggnggcaccc agagtgggta 60 

acatgaaaac tcacaggcag caggaggaac aacagaggcn »Wt«g aaggoggc 
tngcagctga gatggcagcc acgattgaga ggactacggg gcccgtgggg gc 

<210> 62 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<210> 63 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 63 ^ nf-antacccc 60 

120 



ssss -™ ™r, g s— * ~s 

ScLaal gatgctgagg goatggggcc aggaccaggg gagaggcac. gotecttcel 
|agcag"*c SalcaS4= cacaggotc cctaatgctg gtctctgctc caccccccgg 



180 
240 
300 



<210> 64 

<211> 294 

<212> DNA 

<213> Homo sapiens 

Sg?cctcc2 ggaagggcag tcccacaggc agcggcaccc atttctgggc cccg 

<210> 65 
<211> 300 
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<212> DNA 

<213> Homo sapiens 

<400> 65 



tggttgaSt tttttaaJ^ ta ! Cttttca "atgagaca aaggttctga ttatgcctac 
cggctgaaat tttttaatct agtcaagaag gaaaatttga tgaggaagga aggaatgcxat 

S f!S! CgC ! catggataga ttccattcL ac^LlgS 

tataaalaae t * , gagttgactg gtagatttgg tggtagttgc tttctcggga 

tataagaagc aaaatcaact gctacaagta aagaggggat ggggaaggtg ttgcacattt 



<210> 66 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 66 



agcagatttg tgataaactt actotao 3aCT aaa ^, a , ^ . . 

Nectar a93tgCtgCa gatit " at; ££££ 
t ° aaag9Ctt ^ ^aaaagcac tcaagccagc taatatgtta gaacggctaa 

t^acStK g tc ag :r gg aCtaaatatC ggtgccLga Iggctgccgt 

taaacttttt atctggtgag aagtttaaag aatgtttgga taagttccta aggatgaatt 



<210> 67 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 67 



tgttcttgta gtgtttgttg ctattgttag aaagattatt agtgatatgt ggggtgtctt 
anctaaacaa cagacacatg taagaaaaca ccagtttgat catggagagc tggtttacca 
tgcattgcaa ttgttagcat atacagccct tggtatttta attatgfgac tSaactctt 
ZtltT^Z CaCa " gtgtg tta ^atc actgatctgc tcaagacagc tatttgga^ 
gctcttttgc aaagtacatc ctggtgctat tgagtttgct atattagcag caatg^caat 



<210> 68 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 68 



Sctgcc 33 ttS? 9 tCatagaa ^a gttagtagta ggttatgaaa cctctctaaa 
tfcS^ " a " taacc ccaatgatga tggaaaggag gaaccaccaa ccacattact 
ttgggtccag tactacttgg caoaacatta tgacaaaatt ggtcagccat ctattcxcttt 
ggagtacata aatactgcta ttgaaagtac acctacatta alagaactct ttctcgtgaa 
agctaaaato tataagcatg ctggaaatat taaagaagct gcaaggtgga tggatgaggc 



<210> 69 

<211> 300 

<212> DNA 

<213> Homo sapiens 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
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180 
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<400> 69 ataa a tt tta ccttttaaac atccggctgc ctgtgaatga 60 

aattcnacao g*ggtggccc atccggttgg tggggatcca 120 

gaagaataaa atcaatgtgg gaattgggga g » acctttctta cgcccagcat 180 

ccacaatgga ggcttcacca aggcgtggtt ^catgaag ccccagtgct 240 

cttc.tc.tt atggtgtggt attggaggag ^caccatg ^ ^ tcccagtagg 3 00 
tctggaaaaa gtcatctttg cccttgggat ttccatga 



<210> 70 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 70 aaatcaacc t gggccaatca tcaaagggtt ggacctaagg 60 

cccaaggcaa gctgttaaca aaat " a ^ a a * aaatttct cgtaagttgt tgctttettt 120 

tt gctatact caatagaaca agcattttaa J^attt^ tagctcgggc 180 

atgtggtggg tgtggcttta aagagcac aa aatqattc atggacttgt gaatgccagc 240 

ttgtcttttg «gSS acScc-aggg ccaLaaatt cacgcaacaa 300 
cccagtcccg gcttaggtct atcugcw 



<210> 71 

<211> 300 

<212> DNA 

<213> Homo sapiens 



ZEES SSS »• 

<™2 S3= -™ 9 " 3CCatc9 300 



<210> 72 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 72 catctaactg attttgttac 60 

ggattctttc actgagcaoa aagagttgtt 9»**^ "attXcag agaaacctac 120 

Sggttgatt ctgaccatag gaagtatgca atgtgaatca ctat g ttataagaaa 180 

aacagatgct tgatgttgta gaaactggga cata^gata ^gtacatgat cagtttggtg 240 

sss sss cagtctcaat tttaactaaa 3 



<210> 73 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 73 n-tetttatt tgaaccctat accaatatct 60 

ataacacaca tcacagtatg ctctcagaaa ^ttctttatt * aagggtagtg 120 

gttgatcaat gaccattttt gctcagcatg gagaaacagt gc 
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agaataaaaa ggatcttacc aectt-i- a <-r,= «- 

ttgttctctg ttctagaaag Jaaata^* t9a9ggtggc tttgctctct ccattccaag 180 

t ^i tgt i :~ =: = ,? £ 



<210> 74 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 74 



cagagtcaac atggagcatc tMrf^,. _ 

tgtttatagg ttfct?tjaa ag'alatat UlttT^ g * ttgaagg * tatggtaaaa 
caaaactctc ttcctttagg gctict^a !!!* tgtCtt ggtttt 9*99 atattggata 
gaaacaactt gcttccaaS tacccaStc tatl* ^ gatcatcaga aatttcacca 
cact gnnnnn _ gan ncntgctgct - = attcatggag a =gg ^ 



- ~ v / j 

<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 75 



<210> 76 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 76 



gctagacgaa gtggtgaagc ccaaggactt attttt^^ - . 

cacgtactcc ttcctgaaac cactSgagg aaaaatatS ^ gCtgtaa * ^tgagaaat 
aggtgatatt aaaattgaag tcttctatS aaaaatgtct ^tgacactgc atacagatgt 
tcgctgtgtc ccccaggctf gagtacaata f! ggaCaCCC Weatgtg agatggagtc 
tactgggttc aagcaagtct tctgcctcaa tC ggCtCaCtgc aacctccgcc 

tctgcctcag cctcccgaga actgcaagag gaggcaactg 



<210> 77 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 77 



Sttgcacact ggcJtttct t|ct«^c ° ttt3 «« t ««caccca 

=™ 9t ijss - = « =: :=s 



60 
120 
180 



caagagagag tgatagaatt ggcagtaaaa bh,^ 

cacaatacgt gtacacttga ctgtgaSta 1 ac ^tcctgc cctctgggtt 60 

gaccctcagc ctgcggat^c ctctlgSal S^SS Sttcttcttt Jo 

aaccctctgc tttccatcct gcggcacgca acSS? "gcaggagg agtcggaatt 180 

agaaatggat atgagatagg Lcaataaaa gt^aT^ "0 

CUCa Stgcaaaaaa taccagcgaa 300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
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*«oatcccct tggaccccct ctcttccctc cagatcccag 
ttctccatgg cctcagccac agggtcccct egg 



<210> 78 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<210> 79 

<211> 278 

<212> DNA 

<213> Homo sapiens 



<400> 79 - Actaacaaqg aggaggaaga ggaagatgat 

gtgetgeaga ggaagacagc ctgtcaggat ga f agiccat ccctgccctg 

latcaggctg aatacgaege o.*t«o* LJgStcct gecattattg 

IcagagacL ttcagggect gggtgctgct cagcccag 



<210> 80 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<210> 81 

<211> 300 

<212> DNA 

<213> Homo sapiens 



300 



^ZITZ^ ™S 

~S=a t gac cacgatatc ££££ accagcattc 180 

aacagcagca ggtteatcag gtgtacatca 999 3 tcttttagga aagaagatga 240 

2S£= S5S£ SSS "™ aeaggaggga 300 



<210> 82 

20 



60 
120 
180 
240 
278 



<400> 80 ctcttatact ccaatcccac 60 

gg aac ttCt g agtaattggt atcatttcct agtgactcgg ^ ttctggga gg 120 

SSSS SSS SS= SSjs 5SSS 2=2= - 

Si =s= ss= 5= ~™ — 300 



acc t grcr~c t ggc t g r acc g = a getgeeggtg .J 

acctatgetg gactctcccg gcagactgcc ^tatgg | ccggtgccga 180 

agcacctcct cggcacacac gaettcageg ccaagccagc cccttggtca 240 

gccccgtgcg aacgctgcgc og«tct«* ^cccagg * cagtctttC c 300 
cccccgagga gagcaggaag ctgcggttct ggaa 
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<2ll> 300 
<212> DNA 
<213> Homo sapiens 

<400> 82 



occagctgga cctggtggcc ctttcctagt gcctctact-^ « 

acgtggaggc taggaggtct caggtgctgc cch^f 9 ggggagga 9 a acctgggtcc 
gagtgtctgc ccctcggccc tcSggtggg gcalttaS CCa9agtgtg ^cgggccc 
gggcatggcg gtgcagagga gttfggg? 99 S^SS ^T^" aCcaaaa *~ 

gctggc tttc agccccagac ccca ggg t gg ^ = cc^g egg^ 



<210> 83 
<211> 272 
<212> DNA 
<213> Homo sapiens 

<400> 83 



<210> 84 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 84 



<210> 85 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 85 



ctgggagcca ccaacatagc agattacc^ <n-r«™«.*- 

= ssss ~ i 9 2F 
«, tg ,. gt , , 3gtt -: = ~ 



<210> 86 
<211> 300 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 



tcta^atatL gcccaatcgc tgcccacagt qcac^.r^ *.«. 

agggcagatt ttccaaatgc tcatcaccS SgSctat «catgtgag 
gttaggaaat ggcatctcat tgttttcatc tS???^ ^aetata attttggeca 
aacttgtgag gttgagaaac ttttct'ag cttattSc tactcattga „ u 

gaaatggttg ttcatgtcat ttgctcattt ^ tatt9gcc ^"caagttt cctcctttat 2 40 



60 
120 
180 



cccactgccc ccggtcaaca aacccacttt tatgacaatt m^m- 

taaattttac tggcaggtgt atggttgttg gagggttcct aataaa^" gCttggCtCt 

aatagagctg cttggttgga ggaagtgaag ctggct^t ^ 9Sgttgg ^gacctggc 120 

aegtgetget gctttttttg ccactctaat alLl * ^ccagcagct gatctcttcc i 80 

agaagctgtg gctgtttttt gcttttg^gt ggcaataaaf Wgg«taa 240 

*999t ggcaatgaga aagagtcaca gtgtgggtta 300 



60 
120 
180 
240 
300 
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<400> 86 



cggaggtgct 


ttccttgcca 


60 


cccagacagc 


aattatgtca 


120 


ctttttattt 


ttaaaaaatt 


180 


ggatgggtga 


ccactaagtt 


240 


gaactttcca 


ggagttcatg 


300 



<210> 87 

<211> 300 

<212> DNA 

<213> Homo sapiens 

ctc=^»\.tc=.e«c gcagcttgt. aatct.cagc -accccag. 60 

SSSS == SSS = f aS 5==. -J 

fa™ SSS= =S= =SS ss=s = - 



<210> 88 

<211> 300 

<212> DNA 

<213> Homo sapiens 

ctgaaacaa^agatgtattt caattaaaag acttggagaa gattgctccc aaagagaaag 60 

~ USSSJ ssse sees 

S= SS= S= = 2=2 == - 

<210> 89 

<211> 300 

<212> DNA 

<213> Homo sapiens 

gggg acat g rgtccctcagc tcagcagagg ctgtggtaca acatggtcct tggtgaagac 60 
cScacccct ggaacctccc accatcgtca caactgtagt ctcatttgca "J 
aaacccgatg tcccacagcc agatatacac ccagctccat gccagccctt catgtttacc 
KttSttg ttaattacat gtcagactcc tagagggcct ccagactaat aggaagcatt 240 
tctgScc! acSgccacc cactgattca gaaatggaaa tcacattcca caatctatgg 300 

<210> 90 

<211> 300 

<212> DNA 

<213> Homo sapiens 

ctcatacaga'aagtcagatc aacaaagagt ccaagaaaaa tgcgacccag ctagaccatt 
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tgatcccagg cttagcacac gattgcatgg catccccttt agccacttca accactgcag 
acatccagga agctggactc tctcctcagt ccctccagac ttctggccac cacagaatga 
TtlatTl ttCaaCtga ^ «*tctttgc tccagcctga tactccagac tgtgctggfg 
atagtcatac cccactggct ttttccttca ccgaggactt ggaaagttct tgtttgcLf 



<210> 91 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 91 



aatgcaaagg gctgcagttc tcattcaggc tactttcagg atgcacagaa catatattac 
^aaacatg cttcaattct aattcagcaa cattatcgaa catatagagc 
tgcatataa! ZIT^ attatatcag -aatggcat tctgctgtgg ttattcaggc 
tgcatataaa ggaatgaaag caagacaact tttaagggaa aaacacaaag cttctattgt 
aatacaaggc acctacagaa tgtataggca gtattgtttc taccaaaagc ttcagtgggc 



<210> 92 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 92 



aagatatgca gagatattcc aggatotttt agctttggtg cggtctcctg gagacagtgt 
tattcgccaa cagtgtgttg aatatgtcac atccattttg cagtctctct gtgatcagga 
cattgcactt atcttaccaa gctcttctga aggttctatt tc'gaactgg agcagctctc 
caattctcta ccaaataaag aattgatgac ctcaatctgt gac^gtctgt tggSa^ct 
agctaaotct gagagcagtt acaactgttt actgacatgt gtcagaacfa tgftgSct 

<210> 93 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 93 

t?ttct?cat tT,^ C ! gaga9tCa ^cacgttcc tgataggttg tcattgattt 
Httr.lt 9 fc ^ ttttaac cttctaaaca tctccaggcc actttcttag cctttttcta 
latt^t a9a9gtCCta "cacacctg cctcacactt ctcctttoca aggctgcctg 

agtttggagg ggcttgggtg tgtgtgaaca agggccctgc attgtctagg cc^gcagttc 
ccaggcttgg gttcactttc accatgcatt ggcaaaacta gaaaagtafg cttltgfcaa 

<210> 94 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 94 

a^Kr Ct9 agCaCCCaca ^tttcaggat ttagactgtg tggcacctca gctttcetct 
tT, aCtCCtt ^ t *agagaggga actcctcacc aatcccattt gacaaaggct 
SSET CattCt S ctt ^ctttagtc attcttgtca ttgggctgca gaagaaaaac 
aactttgctg ggtgatccca ctgccttgat ttcacctcgg agcgaggctg ggccatgtcc 



23 



120 
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60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
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aagtcttatg aggtcaccct gactagaaaa aattgaactc acctacaaat agtctgaaag 



300 



<210> 95 

<211> 300 

<212> DNA 

<213> Homo sapiens 

gtgagtccgrgcatcagtgg cttctggagc agaccagcca cgtggaagag ^cttaca *° 
aaoataagtc ggcagagccc tgctgatggc tgggccttgt gggcagccac tctgtgtgag 120 
caagSttg g|cccataca cttcaaagac cagagccctg cactgggaga gtgctcctgg 180 
cccaggotal glatcacctt tcgaggccct tcagactctg gcggggcttg ctgtggcctc 240 
cctcSgcta gtggtgtggc tgagcagact ccagggccag ggccagttcc cttctoccct 300 

<210> 96 

<211> 300 

<212> DNA 

<213> Homo sapiens 

acaactccaracataattaa agactggccc aggaggaaga gggcggtggg ctgtggcgcc 60 
ggctcctcS ccgggagggg cgaggtcggt gcagaccttc ctgggagcct ^ctgctt 120 
aaaacagagg gcaaggacca cggccttgaa ctcagcatcc acaggacgcc catcttggag 180 
gaStSagc Lgagggagt gtgccagctc ccagaccagt cgcctcccag gaacagcatg 240 
cctaagScg aggaScctc ttcctgggga cagtttgggt tgagttccag gaagagagtc 300 

<210> 97 

<211> 286 

<212> DNA 

<213> Homo sapiens 

atccaqqgcrcangttttaa tttnttttta aaaagcttta ggtcttgccg ggacggtggt 60 
tcacncSnn nnnnnnnnnn nnnnnnnagg cctaggcggg tggatcacaa ggtcagcagt 120 
tcaaaa^aq cctgaccagc atggtgagac cctgtctcta ctggaaatac aaaaaaattg 180 
gctggg'cgag gtScaggca cctgtggtcc cagctacctg gg aggctgag gcgggagagt: 240 
ctcttgaaac tggaaggcag aggttgcggt gagccgagat tgcgcc 

<210> 98 

<211> 300 

<212> DNA 

<213> Homo sapiens 

caccattttrattttgatgc ttacactcat ttattctgtt tttgtaaaac agtttcggga 60 
atttaaaaat ccttccagtt aatagagctt ttgttattat attataattt tgtaaaccca 
ctttgttttt cccactttaa agccacaggg tcgactcatg gatgatacct ctattgctgc 
tgcatgatgt tcaagaccgg cccttggctg ttgttacaga gatgttgggc agagctatgc 
algtg^ttca ttgtgaactc tagctttgat catggtaaaa agttaaccct ttctattttt 



180 
240 
300 



<210> 99 
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<21l> 300 
<212> DNA 
<213> Homo sapiens 

<400> 99 



agcctcgcct gggccggcct gtggctccca ttttcctttc agcgggacaa aggggacttq 
ttaccaggcc attttetgg. tggcctgtga gatctctgcc cctccaagac cSccaaJtc 

tgSccSg c^ccT gaaccatggg ggcttStc 

IITJSZ f Ctgcagcctc cccaatccca gcccactttg ctgtgtctct ggcgggctgt 
cctccttggt gggagctgtc ctgcacactg taggatgctt aaaggtatcc Sgfcctcca 



<210> 100 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 100 



at^ CC 9 gCgatgtCac cacrcatggtg gctcaggtta gagctctctg aggacccagc 
ccataSat f^ 9 ^ 999 gaccccacca ccgtcatcaa cacttacS 

at a t aa «! 9 ^ ° ttCagagtg ccttggcect agacctccct tcattctttg tagagatgga^ 
atctaagaat gaaacatctc cactcagtcc tgcaaatatg gaagttcttj aaatlccttt 
ttttggtaga tacttgtgct ggtattctga gagtcacttt actctgatgg tttg^gat 



<210> 101 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 101 



tafattctat £S T gCacaacatt 9"UUtgc cccctaaaat gtaacttctt 
tagattctgt tgttacgtgc aacactgtat atctctccat agcacttaat cagagtttat 
aattaggcat ctttttgtgt gattatttgg taaatgtcca tatcccctac tagcctataa 
gctccatgac ttctaggtac cotgtctgac tacgtgtatc actggttcta ccgcctaaca 
ttgcctagca cattoattgc ttcacaggca tctgaatatg ggtKataaa ataStgct 



<210> 102 
<211> 270 
<212> DNA 
<213> Homo sapiens 

<400> 102 



SS^S* I gCCCCtCC tar«tctegt atgatggtca cagtccggtg gccgtggggg 
tatltT I ^actgccca tatctgtgga ctgccccttc caafgacccc 

tggaggaggt gtnnnnnnnn nnnttnntgn ncccactacc ntgcactgaa ctggccntgt 
tacancaann actgnncccn nttgttatna cacctntnac aaacacctgc tgSgtacft 
gncnctactt taaggactnn anacctgtgc tgctgtacat 



<210> 103 
<211> 300 
<212> DNA 
<213> Homo sapiens 
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60 
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180 
240 
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60 
120 
180 
240 
300 



60 
120 
180 
240 
270 
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s== £=s =s ss= =S 5Si s 

S= 32S2 == ~S H ™i £ 

agaccccgag gtgcgaagca atgccatctt cgggatgggc gtgctggcag agcatggggg 

<210> 104 
<211> 300 
<212> DNA 
<213> Homo sapiens 

c t cgcg"ctc\tcactgcac attgcaatgc .tttgog.tt coojtttctc ^taggagc 60 
C aacctqc,qt ggcgctgctc ccagagccgt gggtcccaga ccttgcgttc cttttgttcc 
^fccaSta Saggacacg ggccccacct gtcacgtgcc cgaggccacc caagcccagc 180 

S£ 2==S s= =sss s= =s=s - 

<210> 105 
<211> 300 
<212> DNA 
<213> Homo sapiens 

551 5iS SSr. ssss = =s= 

<210> 106 
<211> 300 
<212> DNA 
<213> Homo sapiens 

g = t ca,^r^t g a=c= atc t c=, tg c csa.taeaat acatctacct J. 

a 3 caca t c S = Jg-^ ~£ f/,™ ™ tg «. 

552 S= S== s= e= = - 

<210> 107 
<211> 300 
<212> DNA 
<213> Homo sapiens 

tgtgagtttrctatctgttc cagactagta tcgccaatct ctcccagctc tcttctttce 60 
Scctggcct ttgtcctgca ggaggtagca tcacctcttg gcattttgta catgctttta 
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aacaattgga ggagctgccc aggcagtttt atggcctcct ggttgtgtgc cttcacaccc 

IcZlZl TcllTt?* 3tCaagC9Ct ^caatgc Sgt^gct ggcc'tgagt 
tcctgagtca gctccttgcc agggccagag ctggtaaoag cggggcagca gggtgggtag 



<210> 108 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 108 

aggttgctca cctgaaggag cacaggaggg ttttccaggc catgtggcto aoottr^n, 
TcTalT 1 ; gCCCCtCa * C -tctacaaga aggtgct|ct gat^gcat gfcgccatcc 
tgccgcagct ggcgcagccc acgctcatga tcgacttcct cacccgcgcc tSaccS 

S2ES 9CCttgaaC9 g9CtgttCat Ctt ™ 

cggagtaccc tgacttctac cggaagctct acggcctctt ggacccctct gtctttcacg 



<210> 109 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 109 



Itttlllatl ^aagttaa ttaacattga aagatgagaa gacatcttgg aagaacttga 

gaggSggaa SSSS ST""" attgtggtg caCggcaS 

s agtaggaa agtatagatc tccagggaca gtagtcatgg ggttgggqca ctoi-t n( T«h 
ttaaggttgg aaggatatat tggagcccct tgaatacggt flcalgSc accttSaca 
gtggagagtt atcagagtgt ttgaaaagga gggttattga gtaaaSat £££S 



<210> 110 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> no 



gacaccccag atgcagccac caccagcaga agcgatcagc tgaccccaca agggcacata 
gctgtggccg tgggctcagg tggcagctat ggagccgagg atgaggtgga qSSSa? 

agc^St 9 2S?S ™*" ^™ 

IrantTt a 9agcttctt tgacgtggac acctcacagg tcctggaccg ga^caaaqac 
tcactgctgc cccggcctgg ccacaacttt gtgcggcacc atctgcggaa tcgg^gat 



<210> 111 
<211> 271 
<212> DNA 
<213> Homo sapiens 

<400> 111 



180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



S52 2=5= ~2 2— :~ - J- . « 

tggggggggt ggggnnttcc ttctannccn ntacnctatg tgtttaatnn n^!7 t 

SSS SSSS T~ ~ * «; 
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<210> 112 
<211> 300 
<212> DNA 
<213> Homo sapiens 

attccctcarcttattcctc caagttcccc cttgggaacc tctgagatta acttgataag 60 
ctccttgggc aagctcttta tcctaagatt cctcagtgag ccttatagag ttgctgcgag 120 
aattacaSt gttcatgatg tcaagtgtct ggtatgtagc taatgcttat tgaacacata 180 
ataatttatt Lataa^tgt catgatcact ggatgagata tagccactgt ggaggtaggc 240 
acaccagggt ^ttagaggct tgggatcttg caacaggatt tteetcttgc ctctccaaac 300 

<210> 113 
<211> 300 
<212> DNA 

<213> Homo sapiens 

eccaca^c^ttgag tttgaagatg gatcccagat agcaatgaag agagaggaca 60 
tctacacttt agafgaagag ttacccaaga gagtgaaagc tcgattttcc acagcctctg 120 
acatgcgatt tgaagacacg ttttatggag cagacattat ccaaggggag agaaagagac 180 
aaagaSct gagctccagg tttaagaatg aatatgtggc cgaccctgta taccgcactt 240 
ttSgaagag ctctttccag aagaagtgcc agaagagaca gtagtctgca tacatcgctg 300 



<210> 114 
<211> 300 
<212> DNA 

<213> Homo sapiens 

a=agcta g °g\a"aggatc.: gaatggcata gacttaactc °tgtg=aaga 60 
qcttcaaga. aagaagatac atatgtacat tttaatgtgg acattgagct ccagaageat i 
JS-i. tgcag f atc ™,a C tcaaacac a c.agcot ,a 240 

ESSE SEE SEES ESSE iJ*. — - - 

<210> 115 
<211> 288 
<212> DNA 

<213> Homo sapiens 



atoatctgcc tgccttggtc tcccaaagtg ctgggaatac aggcatgagc caccgcactc 60 

ggccaggagc tagtttStc agcatcctgc tccactgoct tcctctagtg cagcctggaa 120 

gacatggcag ogggtagctc ctggggctga gccagaagca tcactgcagt ^aagtctct 180 

Icttacctgt ctggctcagc ttgggcaagg gctgggccat atgtgctcag ggacgtgctt 240 
ctcttgtaag g Ca ggaggat anaanaggac cannaanggn gggagctg 



<210> 116 
<211> 300 
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<212> DNA 

<213> Homo sapiens 
<400> 116 

tcaattagta acatctgaaa aaacagcttt gtcctgggtg aaaaaggatg ccaaaattgc 

ttllTttTc SSS 9 " 9 9agtCCg9ga 9-tgtgtg.t gcatgfgaag caacattgtt 
taacattcac tgggtctgcc aaaaatgtgg atttgtggtc tgcttagatt gttacaaaac 
aaaggaaagg aagagttcta gagataaaga actatatgct togatgLqt SotoaS 
acagc= t cat gatcacaaac atttaatgcc aacccaaatt «5S££ ctg^^ 



<210> 117 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 117 

gcactttcca gaattctctc atatttgtgg crctqcraatca „ 

cacaaggaaa ggaaagaaga tctggtggaa agctcaggtg g^ga"^ ct^ctc^ac 
tgaggaactg cctcagaagc tgcgatcaca actttggctg aagcccctgc ctcactctaa 
112* f ggCCtCtt 9-taaacca caaggctaag ggctatagac aatggtttcc 

ttaggaacag taaaccagtt tttctaggga tggcccttgg ctgggggatg acag^gtggg 



<210> 118 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 118 



agaacgttct caggttgacc agctgctgaa tatttcttta agggaggaag aacttactaa 
JatiaaS? tgCatg9ata ««*tcttet gcaagcccgt gcagcccttc agacagctta 
tgtggaagtt cagaggctac ttatgctcaa gcagcagata actatggaga tgacxtacact 
gaggacccat agaataoaga ttctacaggg attacaagaa acatatgaao StcSaJca 
cccaggtttg gcatagaaat ggtacccctt gttcaaaatg aacaagaa^ cttagfSg 



<210> 119 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 119 

cagaaaJtJ 3 llttT^ f 6 "*^" ^gtctatag aagacgttaa agaaaacttc 
aatat 3 ^? g " tga3gca ^ccacca cgcctggcca aaggatttaa tgaattaatg 
Jaaac'tSt gCtggg ? Ct * Stattofgg gcctgcattg agactcacat t^tgccatcf 
aaagcctttt aagaggtgga ggttgcggtg agctgacatg gtgccactgo actccaacct 
gagtgacaga gtgagactct gtctcacaaa aaaaataaJJ ccctttaaat latgStaat 



<210> 120 
<211> 273 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 



bU 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
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<400> 120 

cctcagcctt ctaaaaagct ggggctacac ccagctgaag aaattgtaac taaagataga 
ttgtttaaag caaagcaaga aacttctgaa gaaatggaac aaagtggaga agcctoagga 
aalcccaaca gagagtgtgc accccagatt ccttgtagta ctcctattgc tactgaaagg 
acagttgcac atttgaacac tctgaaggac cgtcacccag gtgatttgtg ggcccgcatg 
cacatctcat cccttggaat atgctgcagg aga 

<210> 121 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 121 

agaacgttct caggttgacc agctgctgaa tatttcttta agggaggaag aacttagtaa 
atcattgcag tgcatggata acaatcttct gcaagcccgt gcagcccttc agacagctta 
Ltggaagtt cagaggctac ttatgctcaa gcagcagata actatggaga tgagtgcact 
gaggacccat agaatacaga ttctacaggg attacaagaa acatatgaac cttctgagca 
cccaggtttg gcatagaaat ggtacccctt gttcaaaatg aacaagaagc cttagatttg 

<210> 122 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 122 

gttgcaagca gccttggaat agtaactctt ctcatttgtt tgggatctgg ccaccaagtt 
ccagaatgat acacggatca gtgcagaagt tcatcaggct ctcggacctt ^gggctgttg 
gagaaggctt cagcagcaga actgatggtg aaggctcgtg ttctccatcc tcaactttct 
ttgcttcgat catacacaag aatacatttg gaagggcaaa aaaatgaaca ctgtcgttca 
ttgcagccgt gttttgtgac acagatgcac agtctgctgt gaagaccttc tctcaagtgg 



<210> 123 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 123 ^ ^ ^„ 

gtgatttcag cttccaaact ggtatacatt ccaaactgat agtacattgc catctccagg 
aagacttgac ggctttggga ttttgtttaa acttttataa taaggatcct **^tgtt^ 
cctttaaata gcaaagcagc ctacctggag gctaagtctg ggcagtgggc tggcccctgg 
tgtgagcatt agaccagcca cagtgcctga ttggtatagc cttatgtgct ttcctacaaa 
atggaattgg aggccgggcg cagtggctca cgcctgtaat cccagcactt tgggaggcca 



60 
120 
180 
240 
273 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



<210> 124 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 124 - n 
catgctggcc agcatccctg cctgtgcaag ctctggatga gctgtgagcc cctgccaccc SO 
acacccccac tccctgccag cctggcctca gggcctctga tccatgtgca ctggagagga 120 
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lltlant a ^ ccact 9 gggcatttcc acgttaacag cagctgccac tggcaaaaga 180 
agtgactcgc caatggaggc atctcagatg tgggcccagg agtctgggga gctactttga 240 
acagggctat ccattcattg tcccaccaaa ggctatggag cccacccacc atgtgctgga 300 

<210> 125 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 125 

ggtaaattgg ttgaattatt gtattgaagc ttgagctgta gctaaaagta atttaggttt 60 

ttaatttSa ItT^ ta ^gacata acacttttgg gaggttgttg tgggagatgg 120 

S ^ ttttcaaaag ctagaaataa aatttacatg ccttagattt cataaaattc 180 

llllatlt^ ??^" aa9g ^etStatct aacttgtgtt cctcctaagg ttatgtccta 240 

ataactattc ttttaggagt atacttctac tttatagaag gttgctttto tttttaattt 300 

<210> 126 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 126 

tgaagaggag atcggtgacc tgggctcctt atgtgcctga aagagtttga gtttcctgtt 60 

aactccaaat caacagtatt ttcaacaaga aatgtgcaat tgaaatcaag tgctgtttaa 120 

gtgcagctag gatttccaca ggaagacact tgcagtgaac agagttatgg agcagcaaaa ^80 

ScSct ! ttt99a ! aa agagaaaaCa tatgcgttgt «ttttgct£ aattataaaa 240 
taccatcctc tcaaaggtgg ttctaaatta caaaggactt tgatttctag gtagattctg 300 

<210> 127 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 127 



60 
120 
180 
240 
300 



ggtgattccc atgctgaaca gtttgatctc ctgccagagt gtcgggccac aaactgggca 
gcacatcagg atcacctggg ggccttcaaa aatcaaaaat ccacccccag gccatgccct 
ggacccactg caccaggaca agaaatccac cccaggcctc tccccagacc cactgcacca 
ggacaagaaa tccaccccca ggccacgccc cagacccact gccctaggat gtgggggtga 
gaaccaggtg gtgctttgta aagacgtgca ggtggtaacc ccaggccccc Lgctcggaa 

<210> 128 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 128 

tgagctggga gaaggggaga aagtttgtga agaggagatc ggtgacctgg gctccttatg 60 

tgcctgaaag agtttgagtt tcctgttaac tccaaatcaa cagtattttc aacaagaaat 120 

gtgcaattga aatcaagtgc tgtttaagtg cagctaggat ttccacagga agacacttgc 180 

gcgSSt ?taS 9 % C a T aaaaCa -0.tct.tt tggaaaafga gaaaacatat 240 
gcgttgtatt ttgcttcaat tataaaatac catcctctca aaggtggttc taaattacaa 300 
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<210> 129 
<211> 285 
<212> DNA 

<213> Homo sapiens 
<400> 129 

ggaaagcaca aggaaaggaa agaagatctg gtggaaagct caggtggcag cggactctga 60 
ctccactgag gaactgcctc agaagctgcg atcacaactt tggctgaagc ccctgcctca 120 
ctctagggca cctgacctgg cctcttgcct aaaccacaag gctaagggct atagacaatg 180 
gtttccttag gaacagtaaa ccagtttttc tagggatggc ccttggctgg gggatnnnnn 240 
nnnnnnnnnn nnnnnnnnnn nnaggaagat accatttctt gacgg 285 

<210> 130 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 130 

ccggacgcag gccctcgggc aggagcatct ggcagagtgg ggggcgtggc aggcaccctc 60 
ctttgcaggg cgaggtgggg cctctgcagc catcctggac aggccggggt ggcggcagct 12 0 

ttgcccacgt ggaagcgggg tgggtctcac ttgcgtggtg gcccctggcc ccatcttgcc 180 
tgctgcggcc tggggagcag gcgctgggtg gtggttctgc ctgcttgctg ctcgttcccc 24 0 

gggcatgcgt gggcagcggg gggcatgcgt gggcagcagg gggccgtggg cagcgggggc 300 

<210> 131 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 131 

gatctctata ctagtgaaca gtgccagttc cacactttgg acttagaact gttctctagt 60 
tattgtaaca cagaatactg tcaatcccta atttacttaa tgttacttat tggaagtggg 120 
gctgatgaaa tacgcacagg agggaaatct actgtgttta ggcacaggca gccccagtgt 180 
ataaggagat catattccaa aaggttgtca gttggttgtt tgcaacctgg aatgtatttt 24 0 

cctttagaga ccaggttatc catggtggtt aggcccctag agcagctgga aaagatgatc 300 

<210> 132 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 132 

ctcccatgga ggtggtggga atggcaccga gaagtttgat gacagttatc taatggacta 60 
gaggttggca aactttctgt aaatggccag gtagtaaata gttctgcttt tgaaggcata 120 
tggtctcttg cacctactcg aggctgaaag cagctataga caatacataa atgaatgagc 180 
gtgagtgtgt tccaataaga aaaaaacatg gctgtttgct tcggccccag ggttgtagct 24 0 

taccagtcct gtaacagatc acagtttgct cttttggtca caaatacttg aacccctccc 300 

<210> 133 
<211> 269 
<212> DNA 
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<213> Homo sapiens 
<400> 133 

atgctatgcc aaagcctgct gccagctcca tagcctggac ctacagcact gcatggtgga 60 

gtccacagct gtggtgagct tcttggagga ggcagggtcc cgaatgcgca agttgtggct 120 

gacctacagc tcccagacga cagccatcct gggcgcactg ctgggcagct gctgccccca 180 

gctccaggtc ctggaggtga gcaccggcat caaccgtaat agcattcccc ttcagctgcc 240 

tgtccaggct ntgcaaaaag gctgccctc 269 

<210> 134 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 134 

gatggatgag actgttgctg agttcatcaa gaggaccatc ttgaaaatcc ccatgaatga 60 

actgacaaca atcctgaagg cctgggattt tttgtctgaa aatcaactgc agactgtaaa 120 

tttccgacag agaaaggaat ctgcagttca gcacttgatc catctgtgtg aggaaaagcg 180 

tgcaagtatc agtgatgctg ccctgttaga catcatttat atgcaatttc atcagcacca 240 

gaaagtttgg gatgtttttc agatgagtaa aggaccaggt gaagatgttg acctttttga 300 

<210> 135 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 135 

ggcgagcggg aacagctctt gaggagtgag actgcaggag atgtgggccg tgccaaagag 60 

atggatgaga ctgttgctga gttcatcaag aggaccatct tgaaaatccc catgaatgaa 120 

ctgacaacaa tcctgaaggc ctgggatttt ttgtctgaaa atcaactgca gactgtaaat 180 

ttccgacaga gaaaggaatc tgtagttcag cacttgatcc atctgtgtga ggaaaagcgt 240 

gcaagtatca gtgatgctgc cctgttagac atcatttata tgcaatttca tcagcaccag 300 

<210> 136 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 136 

gacttctaaa tatatcttgg atataatagg tgataagttc tgtcaattag taacatctga 60 

aaaaacagct ttgtcctggg tgaaaaagga tgccaaaatt gcctggaaaa gagcagtgag 120 

aggagtccgg gagatgtgtg atgcatgtga agcaacattg tttaacattc actgggtctg 180 

ccaaaaatgt ggatttgtgg tctgcttaga ttgttacaag gcaaaggaaa ggaagagttc 240 

tagagataaa gaactatatg cttggatgaa gtgtgtgaag ggacagcctc atgatcacaa 300 

<210> 137 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 137 



33 



BNSDOCID: <WO 9933982A2J_> 



WO 99/33982 PCT/US98/27610 



ttgacaaatt gctggaacac acttattgtg gtttacccgg ttttaattat gtcagagatt 60 

gcatcatcct tatgcttgtt tacatctata atcttctatg aaatggtggt accaaggggc 120 

gcccaacagc ttttatcccc attcttagag catattcttt attataatga ttatccaaca 180 

tatttcttta attttaatac aaaaaataca tcatttaatt tttgttacat atgaacattc 240 
atttttaaat gctcagcctc aagtgcaggc atttttgagt ggcctgatta catattcctc 



300 



<210> 138 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 138 

ggaaggggag ggttggtgag tcccagacct taaaaataca aggttaagag ggaccccaaa 60 

gcaaaaaatt ccaacccttt tcctcccagt cattgaaaca ccaaaactat tataccggag 120 

ggtgtaatag ttttgctgcc cagttgtggt aggccagtag tggcctccca agatgcccat 180 

gtcctaatcc caggaacctg tcaaaattac cttgtatggc caaaggggct ttgcagatgt 240 

aatgaagtta aggatctttc gccaggaaga ttatcccagc ttgttcagga gggcttgatg 300 

<210> 139 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 139 

gacatcattt tcttattcta gtaagagaaa gtacacagat tcaactttag agaggacttt 60 

tttttttctg gagctaaatc aaggaaggat tatcacgtgg cctcccttga atataatttt 12 0 

gaagctgtga acagtaccat cagtaacatt ttatggacag ctctgatggt ttttatacca 180 

cggcactctt cttacctttg ggggaagcta tctggagtta tgactgatgt gtaaagtggt 24 0 

ttactgttag aatcctggtg tgctaggatt ctgggagagt cactttcagg aagttacctg 3 00 

<210> 140 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 140 

gctgcccagg cagttttatg gcctcctggt tgtgtgcctt cacacccgcc tacagcccca 60 

cctcaccatc aagcgctgag ccaatgcggg tgtggctggc cctgagttcc tgagtcagct 120 

ccttgccagg gccagagctg gtaacagcgg ggcagcaggg tgggtagcct ctaccagcca 180 

gggcagtccc tgaggggcca gcaggggggc tgactgccta gtggctcaac ctcctgaacc 240 

cacccactcc cagcgatgct acccagaacc ccaacggcat gaatcctgca cagtgccggg 300 

<210> 141 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 141 

cccaaactta tcgggggtgc cagaggcaga gtagacaagc cttagtggcc gccatttgtt 60 

gaatatctac tgtgcgccaa gcagtgcgtc acaactttat gaagtaggta ttattatcat 12 0 

ccccatttta caggtgaaga aactgagtct ctgagagacc aacttttcca aggtcacaca 180 



34 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 




PCT/US98/27610 



gaggtgggat ccagcccact tccgtctgac cccaagcccc tgctgttaac ccctgcccca 240 
"gtggggag gttccggccc actctggagt tctctggtct gcgtcagtcc tcaggagaag 3 00 

<210> 142 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 142 

gaaaggtggc gcgcttctca cggctgagtt gctgcgcctg cagacggaag ctccccacag 60 

gcagagctgc ttggatgtgt gagtcatgaa gccagagaag ccccgctcca tgagcagtga 120 

ctccccaggc cctgtgacct ccctcctgtc ttgcagctcc tcctggcacc agtccccagg 180 

gctctcctgt tggtagttcc tgcttttctt cttggaaatt cctcgtggac ctcgagatct 24 0 

ttaccctaaa atagttctgt tgaatttcac cctggcaatg taaattgata gcttatcttc 300 

<210> 143 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 143 

cttggccttg cttctctgag aaaactttgg tcacacctcc agagccaggg tgggtgcctc 60 

cctggaggag ggggctttcc tggttggtgg cacagcagga gtccaggctt tgtaccgtgg 120 

acaccatggg ctatggcaac accttcctca ccatccttcc atgaggacct cgggagagag 180 

tggacatgaa accctttgtg ctctgaagca ttcaacagaa gctttctggt tctgtgccta 240 

tttctttggc acttgagcgt gtttgcaggt tcattacaca catgatgaaa gctctggccc 300 

<210> 144 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 144 

cctgactgag tgcctgacgg tggaccccct cagtgccagc gtctgaaggc agctgtaccc 60 

taagcacctg tcacagtcca gccttctgct ggagcacttg ctcagctcct gggagcagat 120 

tcccaagaag gtacagaagt ctttgcaaga aaccattcag tccctcaagc ttaccaacca 180 

ggagctgctg aggaagggta gcagtaacaa ccaggatgtc gtcacctgtg acatggcctg 240 

caagggcctg ttgcagcagg ttcagggtcc tcggctgccc tggacgcggc tcctcctgtt 300 

<210> 145 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 145 

gccagagcct agaggagaga tcaaagaccc tggccgaagt gaagcccatt ctgcaagcaa 60 

ctgggttccc atggcatgtg gtggccttag aggaggtgtt cagcctgcca ccgtcggtgc 120 

tttggtgctc tgcccaggag ctggtgggat ccgagggggc ctacaaggcg gccgtggaca 180 

gcttcctcca gcagcagcat gtgctggggg ccgggggtgg tcctggnccg actcaagggg 240 

annnnnnnnn nnncncaacc cccgctggac ccccngaanc tggcaagacc ngctgcccct 300 
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<210> 146 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 146 

tgactttgta cctggtccaa gctgttgggg aattgctgct gttgacccag gcaggagtct 60 

gactagagaa caaactaagg ttgctgcaac aaacaaggac ctcttccaag aagggctccc 120 

aggcctggcg cagtgactca tgcctgtgat cccagcactt gggaggccga ggcgggtgga 180 

tcatttgagg ccaggagttc gagaccagct tggccaacat gatgagaccc cgtctctatt 240 

aaaaatacaa aaattagcca ggcgtggtgg cgcctgtagt cccagctact caggaggttg 300 

<210> 147 
<211> 295 
<212> DNA 
<213> Homo sapiens 

<400> 147 

ggnaangcna nngnaggaga nagagaagna ncagtnnagn cccangaaac ccnntgaaac 60 

ccttagaagn cagaggagng aaaggangaa aaananggnn ggangagaac nnannnnggn 120 

caaannaagg anganngnta ggngngaaaa anaanaacaa anggggaaaa ngggaaaaaa 180 

ggcganaaag gnaanannag nanaaggngg aananannnn annagaaagg ncaanaaaag 240 

aagnacaaag aaaaangana anaagnaann annanannga cagagacaag aagga 295 

<210> 148 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 148 

cgctgtgctt gagaccaacc tgacgggtac cttctacatg tgcaaagcag tttacagctc 6 0 

ctggatgaaa gagcatggag gatctatcgt caatatcatt gtccctacta aagctggatt 120 
tccattagct gtgcattctg gagctgcaag agcaggtgtt tacaacctca ccaaatcttt 180 
agctttggaa tgggcctgca gtggaatacg gatcaattgt gttgcccctg gagttattta 240 
ttcccagact gctgtggaga actatggttc ctggggacaa agcttctttg aagggtcttt 300 

<210> 149 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 149 

agtgtcagt.t ttcctaatct cagtccaggt aggaattaag aaatatctca agtgttgatg 6 0 

ctatccaagc atgttggggt ggaagggaat tggtgcccag aaaatgggac tggagtgagg 120 
aatatctttt cttttgagag tacccccagt ttatttctac tgtgctttat tgctactgtt 180 
ctttattgtg aatgttgtaa cattttaaaa atgttttgcc atagcttttt aggacttggt 240 
gttaaaggag ccagtggtct ctctgggtgg gtactataat gagttattgt gacccacagc 300 

<210> 150 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 150 

tgtagacttt atgtcagttc tgtgtagact ttatgtcagt ttttgtcatt atttgaaaat 60 

ctattctgac aactttttaa ttcctttgat cttataagtt aaagctgtaa caactgaaat 120 

tgcatggatc aagtaagcat agttttatcc agggagaaaa ataaaaggaa gccatagaat 180 

tgctctggtc aaaaccaagc acaccatagc cttaactgaa tatttaggaa atctgcctaa 24 0 

tctgcttata tttggtgttt gttttttgac tgttgggctt tgggaagatg ttatttatga 300 

<210> 151 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 151 

gcgggcccgg ccagcggaag cccctgcgcc cgcgccatgt caaagaaaaa aggactgagt 60 
gcagaagaaa agagaactcg catgatggaa atattttctg aaaraaaaga tgtatttcaa 12 C 

ttaaaagact tggagaagat tgctcccaaa gagaaaggca ttactgctat gtcagtaaaa 180 
gaagtccttc aaagcttagt tgatgatggt atggttgact gtgagaggat cggaacttct 240 
aattattatt gggcttttcc aagtaaagct cttcatgcaa ggaaacataa gttggaggtt 300 

<210> 152 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 152 

gatattcaca cagtatgtat tatattaacc atatcacact taagttatta aattcagact 60 

atttgtaact tattgttata gggcctgccg tatggcttag gatatttgag taatcatata 12 0 

tttaaagtaa aaactttggg ctgggcacag tggctcacac ctgtaatccc agcacttggg 180 

gaagctgagg tgggcagatc agttgaggtc aggagttcta gaccagcctg gtcaacatgg 240 

cgaaacccca tctctactaa aaatacaaaa attagctggg cgtggtggca cacacctgta 300 

<210> 153 
<21l> 300 
<212> DNA 
<213> Homo sapiens 

<400> 153 

cagagaccag ccttctccag aggctgtcac tgcaggagcc gtgggcctgg gaagacttgg 60 

aagcggcctc tctcaactgg tttctgtctc cgtggagctg gaactgcctg cacttgcctt 120 

cagagggagg cacagtccac ccagatccac ctttccagca agacccccag tggctgccca 180 

gcctgggagc acctctttgc ttttcacacc aaaccaaaac tggcgagagc ccctcctagc 240 

caccagtgat ccccaagcat ccagtacaga accaggcatc gagctagctc cctgcacggc 300 

<210> 154 
<211> 300 
<212> DNA 
<213> Homo sapiens 
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<400> 154 

cttgacctct gtactttaaa gaaatcacta accaaatttt caaagtttcc ttttaaatgc 60 

gtttagctag aaatctatgt atttatccct ttcctatttt gcattcttct cccactattt 120 

ttaaaaactc atttacagta gaaaccattc ttctttctcc caacagtatc ctttgccaag 180 

accatgagaa cagtatggga gcatgttgtt ggtcagggtt tcagaatacg cgtgatgtca 240 

ctgagaatgt ttgctcacag tcaataattg tctttgtgga tgtgataatt ttggagatac 300 

<210> 155 

<211> 81 

<212> DNA 

<213> Homo sapiens 

<400> 155 

gatcattgtt aattagtgac atagtaacat ctgtagcagc tggttagtaa acctcatgtg 60 
ggggaggtgt gggaggtttt a 81 

<210> 156 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 156 

ggcagcacaa gtgtgcaaac agctatggaa agtgaactcg gagagtctag tgccacaatc 60 

aataaaagac tctgcaaaag tacaatagaa ctttcagaaa attctttact tccagcttct 120 

tctatgttga ctggcacaca aagcttgctg caacctcatt tagagagggt tgccatcgat 180 

gctctacagt tatgttgttt gttacttccc ccaccaaatc gtagaaagct tcaactttta 240 

atgcgtatga tttcccgaat gagtcaaaat gttgatatgc ccaaacttca tgatgcaatg 3 00 

<210> 157 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 157 

ctggtgagga gtctttgcga gagcgaggag cagcggttac tggaacaggt gcatggcgaa 60 

gaggagcggg cccaccagag catcctgaca cagcgggtgc actgggccga ggcgctgcag 120 

aaacttgaca ccatccgcac tggcctggtg ggcatgctta ctcacctgga tgacctccag 180 

ctgattcaga aggagcaaga gattttcgag aggaccgaag aagcagaggg cattttggat 24 0 

ccccaggagt cggaaatgtt aaactttaat gagaagtgca ctcggagccc actactgacc 300 

<210> 158 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 158 

cgacagctct ccaatactca ggttaatgct gaaaaatcat ccaagacagt tattgcaaga 60 

gtttaatttt tgaaaactgg ctactgctct gtgtttacag acgtgtgcag ttgtaggcat 120 

gtagctacag gacattttta agggcccagg atcgtttttt cccagggcaa gcagaagaga 180 

aaatgttgta tatgtctttt acccggcaca ttccccttgc ctaaatacaa gggctggagt 240 

ctgcacggga cctattagag tattttccac aatgatgatg atttcagcag ggatgacgtc 300 
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<210> 159 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 159 

agtacccaga gttgcgagga gttttttaac tgatttagcc aggtggcaat catgagtgaa 60 

tggatgaaga aaggcccctt agaatggcaa gattacattt acaaagaggt ccgagtgaca 120 

gccagtgaga agaatgagta taaaggatgg gttttaacta cagacccagt ctctgccaat 180 

attgtccttg tgaacttcct tgaagatggc agcatgtctg tgaccggaat tatgggacat 240 

gctgtgcaga ctgttgaaac tatgaatgaa ggggaccata gagtgaggga gaagctgatg 300 

<210> 160 
<211> 294 
<212> DNA 
<213> Homo sapiens 

<400> 160 

ctttgagcta ggataaaaat tgggtaaagg acatttgctt acctgcaaat gaatcactgt 60 

ggaaatgtga tct toccata tcatcaagaa acttgttttc tggatgaata ctgggagaat 12 0 

aaaatgagaa ctctggagtg agctaaattg atcccaatta agtttttctg cttagcagac 180 

agaaggtata attttttgac accctttccc acctggtgcc tatgctaggc ttgtnctgat 24 0 

aacatccctc actnactnga tnntcacatn gnncttncnc tgangtccca tttt 294 

<210> 161 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 161 

cttcctcaaa gcatggttgc tgagtaccca gagttgcgag gagtttttta actgatttag 60. 

ccaggtggca atcatgagtg aatggatgaa gaaaggcccc ttagaatggc aagattacat 120 

ttacaaagag gtccgagtga cagccagtga gaagaatgag tataaaggat gggttttaac 180 

tacagaccca gtctctgcca atattgtcct tgtgaacttc cttgaagatg gcagcatgtc 24 0 

tgtgaccgga attatgggac atgctgtgca gactgttgaa actatgaatg aaggggacca 300 

<210> 162 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 162 

gccccgtgtg gggagacgga cagcaccctc ctcatctggc aggtgccctt gatgctatag 60 

cgcctcccct ctcccctcag agggcacagc tgcaggcctg accaaggcca cgcccggctc 120 

tcgtgctcta ggacctgcac gggacttgtg gatgggcctg gactctccag aaactacttg 180 

ggccagagca aaggaaaacc tcttgtttta aaaaaatttt tttcagagtg ttttggggag 240 

gagttttagg gcttggggag agggaggaca catctggagg aaatggcctt ctttttaaaa 300 

<210> 163 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 163 



gaccggctgg gcctacaaaa agatcgagct 
ggagggcaaa aaggctcggg tgatggccac 
cagccttaag gtctgcagtt ccaccctgcc 
ggtacgagtg tatgacctga cacaatatga 
aacagatggc ctgtgggatg tcactactga 



ggaggatctc aggtttcctc 
cattggggtg acccgaggct 
catcaagccc tttctctcct 
gcactgccca gatgatgtgc 
ctgtgaggta tctgccactg 



tggtctgtgg 
tgggagacca 
gcttccctga 
tagtcctggg 
tggacagggt 



60 
120 
180 
240 
300 



<210> 164 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 164 

aaaatttata ngtaatgaca aatgacttat cagtgttcat catctgaaag ctaagtggtt 60 
cgttcaatca ctttttcaaa gttgatagta gattgcatgg tttcatgttt cctcatattg 120 
gtttattaat tctatttaat caaggaaaat aacttcagat tccataaagt ttcagtttat 18 0 

ttttagttta ctactaggtg agatagcaca ttacatactt ttactatcaa atattatttt 240 
agcagcttcc catagtacca aatgatttga ttccctactc tcatttttta aagcatataa 3 00 

<210> 165 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 165 

ctggactctg agtcgtcttg gtcccaggag ccagtagtga aggcaacagt ctgcccacct 60 
gtggacacca gatcctggga gctcctggtt agcaagtgag atctctggga tgtcagtgag 120 
gctggttgaa gaccagaggt aaactgcaga ggtcaccacc cccaccatgt cccaggtgat 180 
gtccagccca ctgctggcag gaggccatgc tgtcagcttg gcgccttgtg atgagcccag 240 
gaggaccctg cacccagcac ccagccccag cctgccaccc cagtgttctt actacaccac 3 00 

<210> 166 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 166 

cttctgttga ttggtttgtt taaagtacct aagtactacc ctttgactcc ctaccaaaag 60 
ttcttttgtt ttttaaacaa cttttatttg tgacttactt tcttgagaag tgttcttaat 120 
gaattgcata aaatagtggt agcagcttat ttcttaagta ctttattatt tgtgctttac 180 
catttcaggt tcttatcttt aacccttatt tactcagttt tccatctgaa tgatcctatc 240 
tctaaattaa ggatttaata aatgctgcaa attgtccact ttgcaaattg tccaaaagct 300 

<210> 167 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 167 
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gcgagatgaa gctacactgt 
ttaggaaccg gggcaagggc 
gtcagccgcc ggtccgagcc 
gctatgagct aagggagaac 
ccactgttcg gttaaaggag 



gaggtggagg 
gtccgagccg 
ttcctgctca 
attgagcaat 
cctcctgtgg 



tgatcagccg gcacttgccc gccttggggc 
tgttgagcct ctgtcagcag acttccagga 
tctccaccct gaaggacaag cgcgggaccc 
tcttcaccaa atttgtagat gaggggaaag 
atatctgtct aagtaaggat tccatatggc 



60 
120 
180 
240 
300 



<210> 168 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 168 

gtctgggcag cctacgcttt ccggataaaa atggcagaat gaaagaatta tgagtggaac 60 

tagagaatag gaaagacatg aaccaacgcc caaaatgaga aagaaggaca tataaagaaa 120 

aagacaaata caagtgaaaa aaatatacta atggattaac gtccctgtcg agtgacattt 180 

tctgactatg gaaatgatat tagacaaaaa gcaacttcaa gtgggtttct tatttgagtt 240 

caaaatgggt cataacgcag catagataac ttgaaacatg aacagcgcat ttggcccagg 300 

<210> 169 
<211> 296 
<212> DNA 
<213> Homo sapiens 

<400> 169 

gagatctctg ggatgtcagt gaggctggtt gaagaccaga ggtaaactgc ggaggtcacc 60 

accctcacca tgtcccaggt gatgtccagc ccactgctgg caggaggcca tgctgtcagc 120 

ttggcgcctt gtgatgagcc caggaggacc ctgcacccag cacccagccc cagcctgcca 180 

ccccagtgtt cttactacac cacggaaggc tggggagccc aagccctgat ggccccgtgc 240 

cctncattgg gncccctggc tanttcancn agncccncag gtngagncca aagcca 296 

<210> 170 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 170 

gggtgttgga gcagattgta gttgatccac agcaaagagc atcaccaaag ccattccagg 60 

aggaactaga tccaccactt cctctgctgg gcatgctcca aaaatggttg tggcttccag 120 

agaggactcc aaaagaaagc acaaaaacta gacagtggga gggcataccc aaaagccctg 180 

agtttctgaa aaaatattga aagtttctat ggtgaaatag gaagttaatg tgcttaggaa 240 

gaaaaaagtg gtaatgattc aaggaaacat aatcacacac ggttttagtt ttaatggaca 300 

<210> 171 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 171 

atggaggcac cagcaggtag tggcccctgt aagcagggcc agagtcggga caaagagcag 60 

gagtgaagca gccaagagac agaggaccag gctggagcca gtgggcacgc aggagcctgc 120 

ctgggaaaag ccggggggca aggctggcat gggaatgaac acctgctggt gacacctctc 180 
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tgagcttcag ttcccttaac tagaaaaata gaacaggccc ggtgcggtgg ctcatacctg 
taatcccagc actttgggag gctgaggcgg gtggatcatg aggtcaggag atcaagacca 



<210> 172 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 172 

ggcggaggag cagaagctca agctggagcg gctcatgaag aacccggaca aagcagttcc 
aattccagag aaaatgagtg aatgggcacc tcgacctccc ccagaatttg tccgagatgt 
catgggttca agtgctgggg ccggcagtgg agagttccac gtgtacagac atctgcgccg 
gagagaatat cagcgacagg actacatgga tgccatggct gagaagcaaa aattggatgc 
agagtttcag aaaagactgg aaaagaataa aattgctgca gaggagcaga ccgcaaagcg 



<210> 173 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 173 

gtctttccca ttcacttctc tagaaagctg ccaagacaga ggcagaaaga aatggatgat 
agttctgtca agcacacttc tgttctctta gaacttagaa gtgtttctaa gagaacagaa 
gtaataagag aaacagttac gtgtggaatt caacatcttt ggttggaacg cattggcttt 
ttttttcttg ttttgataga aatggaatta agcaaaagta gtttttgtct tttctgttgt 
cttcaaattt caggccatct atttttaatt taatcccgtt caagtacttg attgttatac 



<210> 174 
<211> 300 
<212> DNA 
<213> Homo sapiens 



240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



<400> 174 

attatttcca aagcagccta cagtagaaaa tagtcattat ggcagcagct tctgatgttt 
ttgtttggta ggttttctga tttcaatata tagaatcata ttcatagagt atcttctttt 
aacgaattgc acaaagtacc catttaaaat ttacatgcac agttcattgc cacctttctt 180 
aggcctatgc atagttaata aggttataat ctactcaaca tggaaaatgg agcctatttg 
caaacacaca agtaattaaa gtaccaattc tctcttagtt tcttttttta tagttggttt 



60 
12 0 



240 
300 



<210> 175 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 175 

tgganactct ttantatgga aggtgaattt cctgtcaaca tagtccagga caaagcagtt 
ccaattccag agaaaatgag tgaatgggca cctcgacctc ccccagaatt tgtccgagat 
gtcatgggtt caagtgctgg ggccggcagt ggagagttcc acgtgtacag acatctgcgc 
cggagagaat atcagcgaca ggactacatg gatgccatgg ctgagaagca aaaattggat 
gcagagtttc agaaaagact ggaaaagaat aaaattgctg cagaggagca gaccgcaaag 



60 
120 
180 
240 
300 
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<210> 176 
<211> 300 
<il2> DNA 
<213> Homo sapiens 



<400> 176 

tataaacttt attttattct cttctggggt agagttacat gacaagaaat tgaattaatt 60 

caataaaatt ttagttcggg ttgcttaggt ttttactgct cccattcttg cttttactaa 120 

tttatccaag attagatgtg attactattt aataataatt tagtcctcac acttacaaac 180 

cacttacaat accagcatgc ttctatcact gtaattctat tcaattctca ggcccatgag 240 

gcatgccagc cagacgacca gacagcattt atagagaggg cactcaatac cagccacaaa 300 



<210> 177 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 178 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 179 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 180 
<211> 300 
<212> DNA 



<400> 177 

gactggagaa gtcagaagta gaaaagcaga ttgctaggag agacaggatg acagattttg 60 

gtcagaaaat gggatattgg agtttaaagt atcaaataca gaatagttcc agatgttcag 120 

agatccagca tgggattagg tactgaaatg gattagaact aaaagtcact agaatttaga 180 

aattgagaac catgagagtg gatgcaatga cttgttgctt gattgaaaaa taaattaata 24 0 

ataataaagg accatgagac tagcctgtta tagggggtat ctccatgann nttgtttttc 300 



<400> 178 

tcctggtgtc aaacactata aacctttgac cagctgagct gtgactgctg tcacatatct 60 

gagtcctgtg tgcacagtaa tatcctgggt caggtaaaat ccaggtcttc aagttttaag 120 

gattttttga agaattcggg cttctttaag acgatccatg cccaaatcca caagcttgtt 180 

gacagtggat tacagtttgt gtggcaaagt ccaagttgtt acactgtgct ttaaaaaaaa 240 
tcttatctgc atgtattgtt aacttagaga ccatgagatc tatttatcag gaccaggaag 



300 



<400> 179 

ctcatgcctg taatcccagc actttgggaa gcagaggtgg caggatcatt ccagcccagg 60 

agttcaagac cagcctgggc aacacagtga gtgagaccct gtctctattt aagaaaaaat 120 

aattaagaaa ttttattaaa aaagaagaat caggaaacca agtccaaccc aactaaacct 180 

caaatgaacc agcccctaac acagatgagg ggatttggga ctgataagct ctgtgctgtg 24 0 

tccatggccc gtcatttatc aaggctgcag ctttgtaaat gtggctattt ttatgttgtg 300 
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<213> Homo sapiens 
<400> 180 

gtgatctgcc tgccttggtc tcccaaagtg ctgggaatac aggcatgagc caccgcactc 60 

ggccaggagc tagttttatc agcatcctgc tccactgcct tcctctagtg cagcctggaa 120 

gacatggcag cgggtagctc ctggggctga gccagaagca tcactgcagt gaaagtctct 18 0 

gcttacctgt ctggctcagc ttgggcaagg gctgggccat atgtgctcag ggacgtgctt 24 0 

ctcttgtaag gcaggaggat agaagaggac caagaaggga gggagctgcc ctgtggtgca 300 

<210> 181 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 181 

cccatgccgg gatcttccca cacccgtcct cacagatcca gccccagccc cttgcttccc 60 

aggccatctc tcagcagcac ctgcaggatg cgggcacccg ggagtggagc cctcagaacg 12 0 

catccatgtc ggagtctctc tccatcccag cttccctgaa cgacgcggct ttggctcaga 180 

tgaacagtga ggtgcagctc ctgactgaaa aggccctgat ggagcttggg ggtgggaagc 240 

cgcttccgca cccccgggcg tggttcgtct ccttggatgg caggtccaac gctcacgtta 3 00 

<210> 182 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 182 

tttgcagtgt tgtcagaaac aaataataaa gccccaaaag attaactagt tgaaaaaact 60 

ggcaaaatct gtatacgtgg aaatttacca ggacagagac tgaagaataa agaaaatgag 12 0 

tttcattgcc agatcatgaa atccaaagaa actttaaaga agatgagttg tgtaaatgga 180 

actgaaggga gggaagagct gccttcgcct gggacaaaga aaacatgtgt atacacatgg 240 

gtcaagcagt gctggtctgt ggctgcctgt ccagaggaat ggaaatatcc cttgtcttta 300 

<210> 183 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 183 

cggacccatc ggagcgtaac ctggatctcc gcaggcctgg cggaggccgg ccacctggag 60 

gggcattgct tggttcgcgt ggtagcagag gagcttgaga atgttcgcat cttaccacat 120 

acagttcttt acatggctga ttcagaaact ttcattagtc tggaagagtg tcgtggccat 180 

aagagagcaa ggaaaagaac tagtatggaa acagcacttg cccttgagaa gctattcccc 240 

aaacaatgcc aagtccttgg gattgtgacc ccaggaattg tagtgactcc aatgggatca 300 

<210> 184 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 184 
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ctgttttgca gatgaggaaa ctgaggtaca gaattcttag ggaacttacc caaaatggct 60 

tttctgcact ctgccctttg gtattgtccc atgtgaattg tttaaaactt atgtgtatag 120 

tggcatgagt aggtgatttc agaaacagaa ctcacttttg ttgtttggtc ttaaaattag 180 

gaacttttct tcatctgggc ttcatttccc tgcaccttcc cagctttcta gtcatgcaag 240 

ccacatgtct ccacgtgagg ggttcattgg aaagcagcca cagagccacc ccctggctgg 300 

<210> 185 
<211> 260 
<212> DNA 

<213> Homo sapiens 
<400> 185 

attatagaga ttaatctcct ttgctcgaag tctatttaaa tattagtcac atctaaaaca 60 
tacttttaca gcaacatcta gactggtgtt tgaccaaaca actgggcatc atagctgaca 120 
cataaaatta accatcacaa ccatgttcta ggcactgttc ctcactgcct gagaagacac 180 
cgttatgttt attagggttt ttgagtttta tccacagctt ttggttatct gcaaccatgt 24 0 

ctcccacctt taacatagtt 260 

<210> 186 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 186 

gataaactct tcagtgacga atattagaaa aagttagtta tacatttgag gaaaactata 60 
aaagtaccaa taatgagtag gaaatcactt ctgcagtatt tttggagcat tttccttaag 120 
catgacataa aagccaaagg tcacaaggga aaaaactgat agatttgtct gtgatattga 180 
gagatgtatg cacatataca tacaacagtc atagtaagac accgttagac aaaaggtgat 240 
gtatgaaaaa gaggcaaaac aacaagaaga aaagattgaa aaaatgagag ctgaagacgg 300 

<210> 187 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 187 

aaaaagtaaa gcttttcatg agcacaaatc ccttgcattg tttgatgtta ctgatattcg 60 
taaaatgaat attttttgtt ttgttttgtt ttattttttt gagacaagtc ttgctttgtt 120 
gcccaggctg gagtgcaatg gcatgatctt ggctcactgc aacccctgcc ttgcgagttc 180 
aagtgattct tctgcctcag cctcctgagt agctgggatt acaggcgctc accaccacac 240 
ccagctaatt tctgtatttt tagtagacac agggttttac catgttggcc angctggtct 300 

<210> 188 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 188 

gagcattcct cctttgttaa cgaagcaaca tttacacaag atggacatta cattattagt 60 
gcatgctctg atggcactgt aaagatctgg aatatgaaga ccacagaatg ttcaaatacc 120 
tttaaatccc tgggcagcac cgcagggaca gatattaccg tcaacagtgt gattctactt 180 
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cctaaaaacc ctgagcactt tgtggtgtgc aacagatcaa acacggtggt catcatgaac 240 
atgcaggggc agattgtcag aagcttcagt tctggtaaaa gagaaggtgg ggactttgtt 300 



<210> 189 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 191 

gcggcgctga cccggccggc cccacacccg ctcttcctct tctttgccgc ggactccctt 
tcctgcctcc aagacctggt gtctcccact gtgagcccag ctgtcccaca ggcagtcccc 
atggacctag actcaccttc cccttgcctc tatgaacctc tgctgggccc agcccctgtc 
ccagctcccg acctgcactt cctgctggac tcaggcctcc agctccctgc ccagcgagcg 
gcctcagcca ccgcctcccc tttcttccgg gccctgctgt caggcagctt tgcagaagcc 



<210> 192 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 189 

ctaatatcca gaatctacaa agaactcaac aagaaaaaaa ccaaccccac aagcgggcaa 60 
aggacatgaa cagacatttc ccaaaagaag acatacaagc aacctaaaat aatctaaaat 120 
aatttttaaa aagaaaaaat gcttgacaga gttttgatag tacttagtaa aaagttatat 
ctagtggctt tttgtttgtt tgtttttgtt ttgtttttaa gaaatagtct ctgtttccca 
agctggagta cagtggcgca atcttggctc actgcaacct cgaactcctg ggctcaagcg 



180 
240 
300 



60 



<210> 190 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 190 

aaccactatg gaggcatgat tggtggccac tacactgcct gtgcacgcct gcccaatgat 

cgtagcagtc agcgcagtga cgtgggctgg cgcttgtttg atgacagcac agtgacaacg 120 

gtagacgaga gccaggttgt gacgcgttat gcctatgtac tcttctaccg ccggcggaac 180 

tctcctgtgg agaggccccc cagggcaggt cactctgagc accacccaga cctaggccct 24 0 

gcagctgagg ctgctgccag ccagggacta ggccctggcc aggcccccga ggtggcccca 300 

<210> 191 
<211> 300 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 



<400> 192 

gacagaccgt tgagaggacg tggaggcccg agagggggta tgcgcggcag aggcagaggt 60 

ggccctggga acagagtttt tgacgctttt gaccagagag gaaagcgaga atttgaaaga 120 

tatggtggga atgacaaaat agcagtcaga actgaagaca acatgggtgg atgtggagtt 180 

cgaacctggg gatcgggtaa agataccagt gatgtggagc caactgcacc gatggaggaa ^ n 
cccacagtgg tggaggagtc ccagggcacc ccggaagagg agtctccagc caaagttcct 



240 
300 
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<210> 193 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 193 

ctcaagaaag gagaagtttt tttgtatgaa attggaggaa atattgggga acgctgcctt 60 

gatgatgaca cttacatgaa ggatttatat cagcttaacc caaatgctga gtgggttata 120 

aagtcaaagc cattgtagaa gacttaacaa gctgcagata accatgtgga cttctgtcat 180 

aattcttgct gagtcaagag tgtaaataaa agaaatggca ggactcatat tattcagttg 240 

tacccaagta tttaaaaatg actctcttaa gccttaaaaa gtcatagatt tgtgctgctg 300 



<210> 194 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 194 

cagaagctta gtcatatttc aaaatgatca aatatcaaga aaaattctga gctgcataac 60 

ttgtataaag taattttcag tgattttttt catggttatg ataaaagaac tggattagca 12 0 

gaaactttta ccctgaatca agatttaatt tttctttgag ctcatcttaa ggatatcgga 180 

acatagggag caaacgatgg tgtggctgcc tcagtgcttg atttttaacg gttttgaaga 240 

gaatagttac atttcttctc ctagtaagaa ctaataaata cattaacaga aatgaattcc 300 



<210> 195 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 196 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 197 
<211> 300 
<212> DNA 



60 



<400> 195 

ctctactaaa aatacaaaaa ttagctgggc gtggtggcac acacctgtaa tcccagttac 
ttgggaggct gaggcacaag aatcgcttga acccgggagg cggaggttgc agttagccaa 120 
gatcgccctg ctgcactcca gcctgggcaa cagagggaga ctctgtctcc aaaaacaaaa 180 
acaaaaactg ttagtgaagg ttccctggga cttttgatat tttaaaaatt gatcttatga 240 
ctaagtagat aaattcattg ccataatgag gctagctccc agataaacag cgtattttct 



300 



<400> 196 

tggatactga caatggtggc aggcatttca agccttttaa attagtactt tttgtcgtct 60 

tgcttattaa aattttgtta attttagcaa agaccaattg ttgtgataaa ctggtgtttt 120 

ttggatgctt caagcacacg ttaaccaatt ttttaattcc ccttttggtt cctcccattg 180 

ttctaaaata ggactttcat attattaaaa cctcaaaaga tgatccaccc aggatgaaca 240 
aagatcacca aggggaaaga aaacattttt tatctttaca gaaaacatgt taagattata 



300 
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<213> Homo sapiens 



<400> 199 

gcaacatttg tctacaactc tactgtaaaa ttggaaatgc ttttccacag aaaaacctct 
caaaatgctg aatgcaaaag ttgggatcac agaaacattg tgcctatttt tggtctgctg 
gaaactgtat ttttacaagg taatccctgt tttcaatata gttcctgtct tgccactggc 
ggttttcttg tagcattttt ctagttctga gattgctact acccaaagta ttcatttctt 
tcttactggg gtgtcctctg tcttcacagc ctgcttctgg attgtaggtt ttttcctttc 



<210> 200 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 200 

agtagaaaaa tacaaagact gtgatccgca agttgtggaa gaaatacgcc aagcaaataa 
agtagccaaa gaagctgcta acagatggac tgataacata ttcgcaataa aatcttgggc 
caaaagaaaa tttgggtttg aagaaaataa aattgataga acttttggaa ttccagaaga 
ctttgactac atagactaaa atattccatg gtggtgaagg atgtacaagc ttgtgaatat 
gtaaatttta aactattatc taactaagtg tactgaattg tcgtttgccc tgtaactgtg 



60 
120 
180 
240 
300 



<400> 197 

atccagatgg gatacctcta aacacgaaaa gaaagaagat tccattagtg aatttttaag 
tttggctaga tcaaaagccg agccacctaa acaacagtcc agccccttag taaacaaaga 
ggaagagcat gcaccagaat catccgcaaa tcagacagtc aacaaagatg tggacgcaca 
ggctgaagga gaagggagcc gcccatccat ggacttattc agggccatct ttgccagttc 
ctcagatgaa aagtcctcat cctccgagga tgagcaaggt gacagtgaag atgatcaggc 

<210> 198 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 198 

gcaacatttg tctacaactc tactgtaaaa ttggaaatgc ttttccacag aaaaacctct 

caaaatgctg aatgcaaaag ttgggatcac agaaacattg tgcctatttt tggtctgctg 120 

gaaactgtat ttttacaagg taatccctgt tttcaatata gttcctgtct tgccactggc 180 

ggttttcttg tagcattttt ctagttctga gattgctact acccaaagta ttcatttctt 24 0 

tcttactggg gtgtcctctg tcttcacagc ctgcttctgg attgtaggtt ttttcctttc 300 

<210> 199 
<211> 300 
<212> DNA 
<213> Homo sapiens 



60 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



<210> 201 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 201 
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ttctactttg ggtccgcgcg aagcccactc acgtgtgatc tgtgttgccc ctctcggtgg 60 

tcccaggcga tccagccatg ccccctgccc ctctgcccag atgcttcagg ggcccggctt 120 

ttcaggcttg ccctcaccag cggccgtcag ccgacactca gggatgtagc taacaccact 180 

ccgccagtgc tttcagtagg aagagctgag gctgcctggg aggcccgggg cgaccggaaa 240 

agggctctct caagttctga aaagagaatc tgccaccaga tcgaatttcg acccctgagc 300 



<210> 204 
<211> 269 
<212> DNA 
<213> Homo sapiens 

<400> 204 



<210> 205 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 205 

ttctactttg ggtccgcgcg aagcccactc acgtgtgatc tgtgttgccc ctctcggtgg 
tcccaggcga tccagccatg ccccctgccc ctctgcccag atgcttcagg ggcccggctt 
ttcaggcttg ccctcaccag cggccgtcag ccgacactca gggatgtagc taacaccact 



<210> 202 
<211> 281 
<212> DNA 
<213> Homo sapiens 

<400> 202 

ggccatggga cagttgcaac agcagttaaa tggactgtca gtcagtgaag gtcatgattc 60 

tgaagatatt ttgagcaaaa gtaacctgaa cccagatgcc aaggagttta ttccaggaga 120 

gaagtactga gccgagaaag ctttgaggaa gacttgtctg tccccacatc tggggatagt 180 

aatgcccaaa atggtggagc tgaagagggg gatggggcgg gcgaggggtg cacagcggga 240 

aggggagtgg tggtctcacg atactgtgac tctgagtaac t 2S1 

<210> 203 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 203 

gccctcagcc acccccatcc ctgccccttc tgagactcac agcacccctt tccttcctct 



cctcccacct cctccctcag cccctcattc tccttgggaa tctgcagagg gctctgggac 
tcactgccgg atgtgaaatc caggcgtcag ctgtttccta ggcaagggca ggaaagtggt 
ctccagccct tgctccactc atgcctgggg gcctggggct gagtggtatc cctacctggc 240 
ctccccctgg cctctgggcc tccagcgctg ggtttgtcga gtgagagaga gagaggagct 



60 
120 
180 



300 



gcggactctc aggacgaaaa gagccaaacc tttttgggaa aatcagagga agtaactgga 
aagcaagaag atcatggtat aaaggagaaa ggggtcccag tcagcgggca ggaggcgaaa 
gagccagaga gttgggatgg gggcaggctg ggggcattgg gaagagcgag gagcagggaa 
gaggagaatg agcatcatgg gccttcaatg cccgctctga tagcccctga ggactctcct 240 
cactgtgacc tgtttcagga gcctcatat 



60 
120 
180 



269 



60 
120 
180 
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ccgccagtgc tttcagtagg aagagctgag gctgcctggg aggcccgggg cgaccggaaa 240 
agggctctct caagttctga aaagagaatc tgccaccaga tcgaatttcg acccctgagc 300 

<210> 206 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 206 

gggattacag gcatgaccca ccgcgcccag cctgtaattt cttatacttg gtattttgta 60 

cttggattat gcttctgata cgctataatt atttatgtac atgttatttt tcttcaatag 120 

actgtgaact cttcgaatgt aggactccta gagctagata ctcaattatt ttttattaaa 180 

ttgaatgact tgaaactaca gatcctttat ttaaacttcc caaatttctg ctttatctag 240 

gcaactcttt aaattctttg atctcatgta gattccaaag gctgaaataa ttgagatttt 300 

<210> 207 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 207 

tcctgaagct cggggggctg caggtcctga ggaccctggt gcaggagaag ggcacggagg 60 

tgctcgccgt gcgcgtggtc acactgctct acgacctggt cacggagaag atgttcgccg 120 

aggaggaggc tgagctgacc caggagatgt ccccagagaa gctgcagcag tatcgccagg 180 

tacacctcct gccaggcctg tgggaacagg gctggtgcga gatcacggcc cacctcctgg 24 0 

cgctgcccga gcatgatgcc cgtgagaagg tgctgcagac actgggcgtc ctcctgacca 300 

<210> 208 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 208 

attccaaagg tttcaaagaa cttggtcata aatatgataa tgagaagaca aagtatttat 60 
attaaaacag tttagtagcc ttcagttttg tgaaaatagt tttcagcaca gaaactgact 12 0 

tctttagaca aagttttaac caatgatggt gtttgcttct aggatataca ctttaaaaga 180 
actcactgtc ccagtggtgg tcattgatgg cctttagtaa attggagctg cttaatcata 240 
ttgatatcta atttotttta accacaatga attgtcctta attaccaaca gtgaagcact 300 

<210> 209 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 209 

gagacagcag cccccaggga atgaagctga tgccagagtc agacccgagg aggaagagga 60 

gccactgatg gagatgcggc tccgggatgc gcctcagcac ttctatgcag cactgctgca 120 

gctgggcctc aagtacctct ttatccttgg tattcagatt ctggcctgtg ccttggcagc 180 

ctccatcctt cgcaggcatc tcatggtctg gaaagtgttt gcccctaagt tcatatttga 240 

ggctgtgggc ttcattgtga gcagcgtggg acttctcctg ggcatagctt tggtgatgag 300 
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<210> 210 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 210 

gtaacgtgac acgtatttta cttcttttag taggcggaca cactttctta aagtggtaat 60 

acgtcatggc cctgctataa ggtagtagtt ctagaagact gtttatctaa taattcagac 120 

taaagctatt tatattgctg tgacaccacg tggaaaactt ttataattcc atcttatttc 180 

tgatgtatat gttttatttt ctctgccttc ataagaacta aaaaccaaag ttatttacgt 240 

gaaaacaaga tttttgtttg agttcattta cttgagatat gtttaaaaaa tccaccttct 300 

<210> 211 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 211 

gtccgtcagc tggtagcttt cattcgtaaa agagataaaa gagtgcaggc gcatcgaaaa 60 

cttgtggaag aacagaatgc agagaaggcg aggaaagccg aagagatgag gcggcagcag 120 

aagctaaagc aggccaaact ggtggagcag tacagagaac agagctggat gactatggcc 180 

aatttggaga aagagctcca ggagatggag gcacggtacg agaaggagtt tggagatgga 24 0 

tcggatgaaa atgaaatgga agaacatgaa ctcaaagatg aggaggatgg taaagacagt 300 

<210> 212 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 212 

gcctgctgct tcatgccgcc ggcgtcctgc tccacgtctc tgtgctgctg ggccctgcac 60 

tgtcggccct gctgcgagcc cacacgcccc tccacatggc tgccctcctc ctgcttccct 120 

ggctcatgtt gctcacaggc agagtgtctc tggcacagtt tgccttggcc ttcgtgacgg 180 

acacgtgcgt ggcgggtgcg ctgctgtgcg gggctgggct gctcttccat gggatgctgc 240 

tgctgcgggg ccagaccaca tgggagtggg ctcggggcca gcactcctat gacctgggtc 300 

<210> 213 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 213 

ggtatggttg gagtgtagga atgaatattc atgaaatgtt tcttattgct tttccttccc 60 

taattcatac aatgaatgta tttggaatac ttacatatta taaaataaac tatacctctt 120 

caagaggtat cctgttctgt aagatcagat gtttttattg caggtcaata taatactgcc 180 

agagacagaa aataccccct tatcagtccc ttagtgcctc tttctgtttg tggcatggtg 240 

agaaaaccca tgctgaaaag attgtacttt gtgatcccaa tcagagggag gagctaatct 300 

<210> 214 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 214 

ggaaagggcc ctaaaagaga tgaacaatac ccgtatcatg tggaatgaat tagaaaccct 
tgtcagagcc catatcaaca actcagagaa acatcaaaga gtcttggaat gtctgatggc 
atgcaggagc aaacccccag aagaggaaga acgaaagaaa cgaggaagaa agagggaaga 
caaagaggac aagtcagaga aagcagtgaa agattatgaa caggaaaagt cttggcaaga 
ctcagagaga ttaaaaggaa tcttagaacg tggaaaagaa gaattggctg aagctgagat 



<210> 215 
<211> 300 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 



60 
120 



240 
300 



60 
120 
180 
240 
300 



<400> 215 

atacttttta aacctttttt ggcagctcag atggtgtaaa ttttaaaatt ttgtataggt 
atttcataac aaaaatatgt atttcttttt tgttatttta tcttgaaaac ggtacatatt 
ttagtatttg tgcagaaaaa caagtcctaa agtatttgtt tttatttgta ccatccactt 180 
gtgccttact gtatcctgtg tcatgtccaa tcagttgtaa acaatggcat ctttgaacag 
tgtgatgaga ataggaatgt ggtgttttaa agcagtgttg cattttaatc agtaatctac 

<210> 216 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 216 

gcagatattt actgaaggaa tctaggttgt attttcagtg gacaatggga ataaagcatt 
tctaaagcac cgactggaga ggaaggcaac agagacaagg agagaagccg agagacatgt 
ctgcgtgctg ccacgcatct gagcgattgc tctgtgaaga gttgtacact gaacattttc 
aggggaggct gtttacccag gcaatgtcct caaacaagcc tgtgccgggg agtcctggaa 
tctgtgccag gactgtgttt ttagcccttc acctctcagc tttagcagga catgaaccag 

<210> 217 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 217 

cccccatctt cactggttat tccacttatt taaaatgtcc agaataagca aatctccata 
tagaggaagt agattagtgg ttgcttcggg atgggaggaa tgggaagatt gaggtctttc 
ttttgcagtg ataaaaatgt cctaaaattg actgtagcga tggtcacaca actctgaata 
tgcttaagac cattgaatta cacactttac gttggtgaat tgtatggtat gtaaattata 
gttcaataac atagttacaa aagataatca aaagcatgaa agcactgttg atgtggtttg 

<210> 218 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 218 



60 
120 
180 
240 
300 
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<210> 219 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 219 



<210> 220 
<211> 260 
<212> DNA 
<213> Homo sapiens 



<210> 221 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 222 
<211> 300 
<212> DNA 

<213> Homo sapiens 



60 



acggcctggt ggagcagctg tacgacctca ccctggagta cctgcacagc caggcacact 

gcatcggctt cccggagctg gtgctgcctg tggtcctgca gctgaagtcg ttcctccggg 120 

agtgcaaggt ggccaactac tgccggcagg tgcagcagct gcttgggaag gttcaggaga 180 

actcggcata catctgcagc cgccgccaga gggtttcctt cggcgtctct gagcagcagg 240 

cagtggaagc ctgggagaag ctgacccggg aagaggggac acccttgacc ttgtactaca 300 



60 



caactagaga agattggaca gcaggtcgac agagaacctg gagatgtagc tactccacca 
cggaagagaa agaagatagt ggttgaagcc ccagcaaagg aaatggagaa ggtagaggag 120 
atgccacata aaccacagaa agatgaagat ctgacacagg attatgaaga atggaaaaga 180 
aaaattttgg aaaatgctgc cagtgctcaa aaggctacag cagagtgatt tcagcttcca 240 
aactggtata cattccaaac tgatagtaca ttgccatctc caggaagact 



inn 



<400> 220 

ggtaagtcag gtgattgaat cccggaaagg ttcattgtct tcaagctcac aatactattt 60 

tgggacaaac agttgtctag tgtttggact catgaaccct gattcttgag ggtggtattt 120 

tactgctttt gtgatttggt ttcaacatat atagtctttt ctccggagtt accttaggtc 180 

agtggccagt gtttcagccc ctggaaaggg catgggctgc cactgaggtt ggtcacaggc 240 

ctctcagctc atggtgggag as 260 



<400> 221 

gggttccatc ccttccaccc aggaaatgga ggcacgactt gcagcgttgc agggcagagt 60 
tctaccttct caaacocccc agccggcaca tcacacaccg gacaccagga cccaagccca 120 
gcagacacag gatctgctaa cgcagctggc agctgaggtg gctatcgatg aaagctggaa 180 
aggaggaggc ccagtgaccc tccaggacta tcgcctccca gacagtgatg acgacgagga 
tgaggagaca gccatccaaa gagtcctgca gcagctcact gaagaagctg ccctgatgag 



240 
300 



<400> 222 

gcggtgaccc acgtgtcctg catgattgcc ctactgctgt ggagacctcg tgctgaccat 60 
ctggcagtgt tcttcgtatt ctctggcctg tggggcgtgg cagatgccgt ctggcagaca 120 
caaaacaatg ctctctacgg cgttctgttt gagaagagca aggaagctgc cttcgccaat 



180 
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taccgcctgt gggaggccct gggcttcgtc attgccttcg ggtacagcac gtttttgtgc 240 
gtgcacgtca agctctacat tctgctgggg gtcctgagcc tgaccatggt ggccgtatgg 300 

<210> 223 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 223 

gccccctctg gatcctgagc tccctgctct agacagtgat ggtgattcag atgatggcga 60 

agatggtcga ggtgatgaga aacggaaaaa taaaggcact tcggacagct cctctggcaa 120 

tgtatctgaa gggggaaagc cctcctgaca gccaggagga ctctttccag ggaagacaga 180 

aatcaaaaga caaagctgcc actccaagaa aagatggtcc caaacgttct gtactgtcca 240 

agtcagttcc tgggtacaag ccaaaggtca ttccaaatgc tatatgtgga atttgtctga 300 

<210> 224 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 224 

ctgcggccgc aggagctgtg gcggttttcc taatcctgcg aatatgggta gtgcttcgtt 60 

ccatggacgt tacgccccgg gagtctctca gtatcttggt agtggctgag tccggtgggc 120 

ataccactga gatcctgagg ctgcttggga gcttgaccaa tgcctactca cctagacatt 180 

atgtcattgc tgacactgat gaaatgagtg ccaataaaat aaattctttt gaactatgat 24 0 

cgagctgata gagaccctag taacatgtat accaaatact acattcaccg aattccaaga 300 

<210> 225 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 225 

gccccgctcc atgagcagtg actccccagc tcctcctggc accagtcccc agggctctcc 60 

tgttggtagt tcctgctttt cttcttggaa attcctcgtg gacctcgaga tctttaccct 120 

aaaatagttc tgttgaattt caccctggca atgtaaattg atagcttatc ttcacagatg 180 

ccagacaatg gacaactcac catcagtcct ctgctcacct gagacaaatg catgtctgat 240 

tgcttcctct gccctattgt ttatgtgaaa atgcagattc actgagccag actaaggcat 300 

<210> 226 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 226 

tatataacaa cttttgcttt caaagttggg tgggactaga acacacaatg gaaggatgga 60 

gtcaggagac ctggattctt gtgcccgctc tggcttttac agtctgccta actctatgca 120 

gtcacttcct gccagcctgt ttccttacct acaagaggga gagacactcc ctggccagcc 180 

tagttctcag ggtgaacgaa aggtcattat cactgcatcc tctagtcatt tgcttcttcg 24 0 

ctaattaaca catcttgagc acctgcgatg ttccaggaac aggagatggc agcgtgcaag 300 



54 



BNSDOCID: <WQ 9933982A2 I > 



WO 99/33982 



PCT/US98/27610 



<210> 227 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 227 

ttgctgaaat gggcacttct gctgtgatgc tagtgttgat tttgctctca gatgaacaca 60 

atgtctcata ctaaccaaga agcaagaaaa gccccatgca ttcatttttc acttggagtg 120 

acaatgggag aggtcaggaa tcaagttcac tttcaagatc taagggagtc cactatctgt 180 

gcaattgtat ttggcttttt tttgcactgt ttcaatgctg gtaattgaaa ccattttaat 240 

atatttggtt gtattcactt tatatgtcct tccaaaaatg ttgttgtgta cataccatgc 300 

<210> 228 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 22a 

gctatgtatt gtgtcctacc atgaattcac tccatgctag ccacattggc ctgtatggct 60 

atztccttgga cacacctagg atgttcttgc ctcttagctt gcctaccttt ctctcatcat 120 

ttgggcctca gcgaggatat catctcctca gagaagcctt ctgtgaccat gctatctaaa 180 

atactccagc acttcagtca ccctttatcc cattactctg ctttttcaga aacattggtg 240 

ctccctgaaa catatttgtt tacttgctta gtgtcttttc tcccgcacta ccatgtaagc 300 

<210> 229 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 229 

gattttcgaa actcttcagc tacttgccct tttttatctg aaaccatcat accttctgaa 60 

agaaaaaagc atatcttcat tgacataaca gaagtgagat ggcccagtct tgatacagat 120 

ggtaccatga tatatatgga gagtggcatt gtgaagataa catctttaga tggtcatgca 180 

tacctctgcc tgcccagatc tcagcatgaa tttacagtac attttttgtg taaagttagc 24 0 

cagaagtcag actcatctgc agtgttgtca gaaacaaata ataaagcccc aaaagataaa 300 

<210> 230 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 230 

acttcttgtt tgcctttttt ataaggaaat gttggagagt tacatcattg ctaatgtaga 60 

aatgttaagt ggaaaaatat acagtttggt aaaataaact agattctaca tttatttgtg 120 

ggtttttttc ccctcctttc tttccacagc acttttgata tcaagcaagt ggcttccttt 180 

ttgagatatt aaaaaaaaaa agaaaaggaa aaaagtaaat gannnnnnnn nnnnnaaccc 24 0 

tttctnattn gnattngttt nagnattgng aagttgngtt aaanagtnct agntagaaat 300 

<210> 231 
<211> 300 
<212> DNA 

55 



BNSDOCID: <WO 9933982A2J_> 



WO 99/33982 PCT/US98/27610 



<213> Homo sapiens 



<400> 231 

tgattctttt tgttnttttt tttgatattg acaaaagctt anncnttncn attaaaaang 

ccactaatta gactttttan ntaaaaaang tagggggttt taaaactact ttcctactac 120 

caaaaaatca naaagtatct agctttctaa atngggaaag caagcaatgt tataaaaacn 180 

ctgaaggaat ctctttcttc gggacctttt gttaaactcg gttnaagctg taaaccttat "> An 
ttaaaataaa atttaccaca naacaggaaa tanaacctgg ggaanactcn aaatacncct 



<400> 233 

gaggaagagg cctgctccac ttgtctggga acctgggcag gaggcacaga ggaagccaag 
gcctggagct gcaggtcccc cggcatctct ctctgtcccg gcagcccagg atggcctggt 
gcccccacct gctgcagcag gagccccaag gagtgctagc tgagggtggt tgctggggtg 
gtcctcatgg acagtgaggt gtgcaagggt gcactgaggg tggtgggagg ggatcacctg 
ggttccaggc catccttgct gagcatcttt gagcctgcct tccggtggga gcagaaaagg 



<210> 234 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 235 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 235 



60 



240 
300 



60 
120 



<210> 232 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 232 

ggaagccaag gcctggagct gcaggtcccc cggcatctct ctctgtcccg gcagcccagg 
atggcctggt gcccccacct gctgcagcag gagccccaag gagtgctagc tgagggtggt 

tgctggggtg gtcctcatgg acagtgaggt gtgcaagggt gcactgaggg tggtgggagg 180 

ggatcacctg ggttccaggc catccttgct gagcatcttt gagcctgcct tccggtggga 24 0 

gcagaaaagg ccagaccctg ctgagttaga ggctgctggg atccactgtt tccacacagc 300 

<210> 233 
<211> 300 
<212> DNA 

<213> Homo sapiens 



60 
120 
180 
240 
300 



60 
120 



<400> 234 

ggaacataat tagcttactg atttgatggt tctgtgtagt tcctgaaact cttggctctt 
gtttgccttt ctttaactct ggctccttct ccttcttctg tttgtgtatc tgtttaattc 
attgagtgag gaggacaggc agaactgtgt ctgccaagga ccggatgtac ttctttcctt 180 
gctcttggtt ttttgctcac ttttatatgt aaggtattag tacaaaccta aaggagagaa 240 
agtagaggat cagatcattg ggacttgttc tggtttcaag aaagaattaa caaattgccg 300 
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gttggctcaa gggccaccag aagcatttct ttattattat tattttttaa cctggacatg 60 

cattaaaggg tctattagct ttctttccgt ctgtctcaac agctgagatg gggccgccaa 120 

ggagtgcctt ccttttgctc cctcctagct gggagtgacg ggtgggagtg tgtgtgccca 180 

ggtgggggtg tctcctggct gggaaggagg gaaagggagg gagagttttg cgggggttgg 24 0 

cagtggagag caggctggag aggagatggc taatagctgt ttaatggaaa cctgctgggc 300 



<210> 236 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 236 

gaatcatcga aggttgagac cgtgtctagt tacatagtta taaataccca tctatgtact 60 

gatgccttct aaatgtctat ctccagtatg gtcttttcct ttaagctcta gatccattga 12 0 

caccctcacc atctctaaaa ggcatttcaa actgaacaca tctgatacag aacttttcat 180 

ttccttccca actttgccca cgccagcctg ctcctccttc acgctttcca cttagtatat 240 
gatcccacta ttcactcagt ctctgaagct taaaacctag gattcatcct tgactactgt 



<210> 237 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 238 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 239 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 237 

caggacatgg agcagtacct gtccactggc tacctgcaga ttgcagagcg gcgagagccc 60 

ataggcagca tgtcatccat ggaagtgaac gtggacatgc tggagcagat ggacctgatg 120 

gacatatcgg accaggaggc cctggacgtc ttcctgaact ctggaggaga agagaacact 180 

gtgctgtccc ccgccttagg gcctgaatcc agtacctgtc agaatgagat taccctccag 240 
gttccaaatc cctcagaatt aagagccaag ccaccttctt cttcctccac ctgcaccgac 



300 



<400> 238 

cactggctac ctgcagattg cagagcggcg agagcccata ggcagcatgt catccatgga 60 

agtgaacgtg gacatgctgg agcagatgga cctgatggac atatcggacc aggaggccct 12 0 

ggacgtcttc ctgaactctg gaggagaaga gaacactgtg ctgtcccccg ccttagggcc 180 

tgaatccagt acctgtcaga atgagattac cctccaggtt ccaaatccct cagaattaag 240 
agccaagcca ccttcttctt cctccacctg caccgactcg gccacccggg acatcagtga 



300 



<400> 239 

atttcctcca gtcctgggcc ccatccttga gggccttccc agccagccag caggagaggc 
aagaactggg ggaacacagg aacctagggg aggaggggag cgctgggcat cctcaggctg 120 
gcggccaagg cctgcccctg gaggcactag aggagggcat ctgtctgtgg gagcccagag 180 



60 
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gctgcaggga ggaggaggag ggaggtatct ggtgtgagcg ttgcccctgc gacatttggg 24 0 

accacacagg tgggcttcct tattccctga caaagcctct gtttccagct cttccgccct 300 

<210> 240 
<211> 274 
<212> DNA 
<213> Homo sapiens 

<400> 240 

catgagtgat attttggtct gggtttcctc ttaagatttt agtttgtctg aattaaggaa 60 

aaatgttttt aatatacatt cttattttgt cccacccctc cagaaataag ctggaaatct 120 

taactttttg gggggtcttt tttggtgttt taatgggccc agaactgtgg tttaaatttt 180 

tatgtatgta ttttcttttt tgtggagtat aaatttaaaa actggatttg ggacctaaaa 24 0 

tactcctcag gttgatgtat tcatgaaagt ttta 274 

<210> 241 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 241 

ctgttgcctg ccaagctcag ggcccattta tcatgcatct tcccatcctt gtctccccca 60 

actgtccctt acctgagtca caatttcgcc aaagccaaag ggattgtcct aagccaatgt 120 

tgatttatca ctcttcctgc tcaaaagccc ccaagatcac ctatcaatca cctacttgag 180 

tgcaagcttt gactctgtca cctgacattc aagtccccct ctgcccccat gccagtctta 24 0 

tcccctcccc tacatatgcc ctatgcctca gtttgccttc cctccacttt aaaaagcctc 300 

<210> 242 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 242 

ccgctggcta tgtggacgct ggggcagagc caggccggag tcgaatgatc agccaggaag 60 

agtttgccag gcagctacag ctctctgatc ctcagacggt ggctggtgcc tttggctact 120 

tccagcagga taccaagggt ttggtggact tccgagatgt ggcccttgca ctagcagctc 180 

tggatggggg caggagcctg gaagagctaa ctcgtctggc ctttgaggta atggggggtg 24 0 

gcggtggtgg ggggtgctta gtggctatgc tcaccccgct ccaggaggcc tattttggta 300 

<210> 243 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 243 

caagatctgg aggaatgcag agaggaactt gatacagatg aatatgaaga aaccaaaaag 60 

gaaactctgg agcaactaag tgaatttaat gattcactaa agaaaattat gtctggaaat 120 

atgactttgg tagatgaact aagtggaatg cagctggcta ttcaggcagc tatcagccag 180 

gcctttaaaa ccccagaggt catcagattg tttgcaaaga aacaaccagg tcagcttcgg 240 

acaaggttag cagagatgga tagagatctg atggtaggaa agctggaaag agacctgtac 300 
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<210> 244 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 244 

agtaaatttt ttatgcatat tttattgcaa taaaaaatga aaacagtttc aatctaggag 60 

gattttggat gcatctatgc cttgagaaat gaatggtttg atgtaaatgc atggtagcaa 120 

gaataaataa ttatgttaat tcatataata tgttatatat agttttaaag aaaattctat 180 

cactgtcttc ctatgggtag ggctataatg tccagttctt tcagggatta agagggtagg 24 0 

gtctgaagtt aatccttgtt tgtcgtaatg ttattaattt attcaaccaa gacttaattg 300 



<210> 245 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 245 

tagacataga aaacatacag taagaatatg gtattataat cttacgggac cactgtcaaa 60 

tacgcggtct gtctttgaaa agttgtaatg cggcgcatga ctataaatac ctagctggtt 120 

agcatttaca ttccttgcca gggagtttga aatttatact atagaaataa ctttaggttt 180 

taggtagagt taaagaggta aagcacatgt tgccacaacc caggaaagta tttttaagaa 240 

agattggatt ttcctacctt tagagatcta aaaaaaattt aatataaaaa atcattttgt 300 



<210> 246 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 246 

tggaatattt gctgtgaagg gagaaaggga gagaaaactc ttctgaggat catttgtctt 60 

ggtagtatag taaaaccaac cagctgaacc tttcaggcta caagagaacc cgggtcggta 12 0 

atgtcttttt aagaataatt tttaattgct tataacaagc atattttgtg gcatttgaac 180 

tatatttact gctccaatat ccgttatttt ccaaaggatt ttgtatcttt ttgaaaatgt 240 
ttacatcatc agatgatcca cagaattcac tttatgtgag atctcccgag agtttccatc 



<210> 247 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 248 
<211> 300 
<212> DNA 



300 



<400> 247 

gtgttgctca gtgagcagac ccgactccag aaggacatca gtgaatgggc aaataggttt 60 

gaagactgtc agaaagaaga ggagacaaaa caacaacaac ttcaagtgct tcagaatgag 120 

attgaagaaa acaagctcaa actagtccaa caagaaatga tgtttcagag actccagaaa 180 

gagagagaaa gtgaagaaag caaattagaa accagtaaag tgacactgaa ggagcaacag 24 0 

caccagctgg aaaaggaatt aacagaccag aaaagcaaac tggaccaagt gctctcaaag 300 
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<213> Homo sapiens 



<400> 248 

gagaggatca cttgagctta ggagttcaaa tccagcctga gccaacataa caagactttg 60 
tctctaaaca aaacagttat tgtttaaaga atctgaaatc ttcatcttta attcaggtag 120 
caatgaatcg agcccaagtt tgtttgatat ccagttccaa gtctggagag aggcatcttt 
atcttattaa agtatcgaga gacaaaatat cagacagcaa tgaccaagag tcagcaaatt 
gtgatgcaaa agggctatca aagggaggct ttttacagag aactaaggaa gagaaggagg 



<210> 249 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 25X 

gaaggcagaa gtgtaaatga acatacagaa gaaggagaaa gcctgctgtg tttggcttgt 
tcagcagggt attatgaatt agcacaagta ttgcttgcta tgcatgctaa tgttgaagat 
cgagggaata aaggagacat aactcccctg atggcagctt ccagtggagg ttacttagat 
attgtgaaat tattacttct tcatgatgct gatgtcaact cccagtctgc aacaggaaac 
actgcgctaa cttatgcatg tgctggagga tttgttgaca ttgttaaagt gctccttaat 



180 
240 
300 



<400> 249 

ctagcctggg caatatagta cgaccctgtc tttactaaaa atgcaaaaat taaccacgta 60 
tggtggctca cacctgtagt cctggctact gaggaggctg atgcaggaga atcatttgaa 1 ™ 

cccaggaggt caaggctgca gtgagctatg attgcaccac tgcaatccag cctggacaac 
acagtgagac cctgcctcac aaaaattata ttctgatttt ctgagtccat gaacacattg 
tccaaatgga tttttctagc tcctccaagt tacagatagt tccacgcaca cacagaactc 300 



120 
180 
240 



<210> 250 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 250 

aggaaggtgg aggggcagga acaggacgga caggccccgg gctctggcac atcctgggga 
acaagggacc acaaggacgg gggcagtctc cagacttccc ctgggcgctt gaccccaggc 
cttgcagggg agagagccag ggcctccctc aggtctttgt tcatgctgtt ttccctgccg 180 
tggacaccct ttcccgctct ccgattctct aaatcctgcc ccatctccca gatcttgttc *> An 
atgtccaagc ttttccagga agtcttagca gctcccacac cgcagagctc gagatgtctc 

<210> 251 
<211> 300 
<212> DNA 
<213> Homo sapiens 



60 
120 



240 
300 



60 
120 
180 
240 
300 



<210> 252 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 252 



60 
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<210> 253 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 254 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 256 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 256 



60 
120 



gcacttctct ctcactggaa agagaactgt tctcctttct ctttcttctg cctattaagc 
ctctgctcct aaactcctca tgtgtgtctg tgtcctaaat tttcctggca tggcaggaca 
aaccccgggt atttaccaca gacaacaaaa ccgcttcact atgatgtatg catgctgcaa 180 
aggaagagac agaatcttgc tctatcaccc agctggagtg cagtggcacc attgcagctt 24 0 

actgcagcct caaactcctg gctcaaggga tccttcagct tcagcctcct ggttaactag 



300 



<400> 253 

gtctgatgca ggagaattgc taaaacccag gagggagagg ttacattgag ccgagattgc 

gccactgcac tctagcctgg gcgacagagc aagactccgt ctcgaaagaa agaaagagaa 

aggaaattcc ccagggaagt acctcggctt atttcataaa caggtactga aggaagcaga 180 

ggcatgtgga ggacttcccc acctcgtgca gctatttggg ccgtggcatc tgaaatttct 24 0 

tatttcagag tcaccccttt gatcraccttcr ocaatoapph nr****-***-*- _ 

— — — — ~« — — — 3 »-wi.5ivjyv 1 ,v;u juu 



60 
120 



<400> 254 

atgttacaga catgaaatat gaacagaatg ctaaaagaac ataaaagaat aagagctcct 60 
taaagattat aaataaatgg tgatgttaaa gtaatagcac cattggacga agctagggaa 120 
tcaacacttg acagaaagat acatattttt tttatacaaa ctacatatat ttgagcaatc 



180 



aagtagtaga catagagaat tttcttttta tggaagtact ctaataagta aagggctgat 24 0 

agaattatat cagcattttc tagctcctgg ggaattatgc attgggcatc catggctgct 300 

<210> 255 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 255 

gctgcctgtg gcatagccac tgctgtacgt ttttggttgt tnttaagaaa ctcgatgaag 
aggggtgtca ttctgggctc ggggtggttg ccaatttttc accagaaagg gagccacccc 
ttgcaaccac ttctgtctcc gttagccccc cctctgccct cctccaagcc aaagcgtggc 180 
ctggcttttg tcttcccatt tagttttcct cttttaccct tccttttgtg cttaatttat 24 0 

taaaatagtt gctgtataat ttattttcat aaactataaa aaaatactaa atggttaaaa 



60 
120 



300 



60 



acagtctcgg gtttcatatt ttgctgtttt tgatggacat ggaggaattc gagcctcaaa 
atttgctgca cagaatttgc atcaaaactt aatcagaaaa tttcctaaag gagatgtaat 120 
cagtgtagag aaaaccgtga agagatgcct tttggacact ttcaagcata ctgatgaaga 180 
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gttccttaaa caagcttcca gccagaagcc tgcctggaaa gatgggtcca ctgccacgtg 
^ gttctggct gtagacaaca ttctttatat tgccaacctc ggagatagtc gggcaatctt 



<210> 257 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 257 

atagaactag gcactgattt gtttatattt atcctgctcg agacacatga tgtttcatgt 
atc tgtggct ttttatagtt taaaataatt tctggaaaag tcatagtcat tatctcttta 
accgctccct ctcttccatt ctctttgttc tctcttcctc gaactcctgt tagtcatttg 
atcctccata tctctgaata tttttgtatt tcttttatta tttatttctt gtctctgcta 
cattttacat tgagtaaaag tgggatgtga cagtgggaaa tcattagtga cttagaaatt 



<210> 258 
<211> 285 
<212> DNA 
<213> Homo sapiens 



<400> 258 

tactctatta tattgtgcat gctcctgatt tagctgctct tggcatcatt ggtcgcagtg 
gaaccttgaa atgcatctgg ctagatttat gctcaaatca ttctcagtta gccttttagt 
gcctcttcaa aggttttttt ttgtatgttt tctattctta ataaaagctt aggattaatt 
agaaagaatc tgatatggtt atgtttcccc ttgtgtacgc tgacctcatt catacgtttt 
tcatagtcca gtggtctaaa cgctttcaag agcccagctc cttgg 



<210> 259 
<211> 300 
<212> DNA 
<213> Homo sapiens 



gccttctctg gcctcaccaa ttaggtcaaa tgttccttag aatgtgttgt ggggcatggt 
ctctccctgt gaggacctgt ccagctggac ctccgccttc ctgcgactgt attggtgtct 
ttccctctca agcctatgag ctctgcaagg gcagggaccc tgtatgattt tgcctatcgt 
atgtcctcca gcccccagca cagcgcctgg tgtccagtga gagctcagca aatactttgt 
gagttaagga caggcggctg ggtagatgga tcgtctgcct agacagggca gttattcgct 



<210> 260 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 260 

gaaaagggag ccgcgcagcg cctacgggag tccggcggca gcagccggta ccggcaacca 
cgggcagctc tcagggaatc tccgtcgtga ggccagaggc tccagtcccc gcgagtccag 
atgcctgtcc agcctccaag caaagacaca gaagagatgg aagcagaggg tgattctgct 
gctgagatga atggggagga ggaagagagt gaggaggagc ggagcggcag ccagacagag 
tcagaagagg agagctccga gatggatgat gaggactatg agcgacgccg cagcgagtgt 



240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
285 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
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<210> 261 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 261 



tttgctttca gtggttggct ttcactgaaa gaaagtgtaa aaaaagtcag aatttatagc 
tttcactatg tccaagacta ggactgggtt ataaagattt tcttttgtga aggaaaataa 
aagaaaattt gccactactg catttacttt actattgtaa acttaagatt cattccttag 
tctttggaat tttgatgtct caaaaccaga tgagtggaag tgctgaattt gcaaaataaa 
gctaagaatg cttaactctg cactttaagt tctactctga ccaaattgaa gatgagcaga 



<210> 262 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 262 

ttttttaaga gataaggtct tgctatgtta tctaggctgg cctaaacttc tgggctgaag 
tgatcctcct gtgtagctgg gactacaagc atgtgccacc aatgcctggc ttctcacact 
gttttgtaac atagatatgt gaagatgtgt attatagaat tgtttgtaat actgtagtgt 
tgtaggcaat gtgactgtct atagggaagt ggacaggtta tttgtggtaa atactcatgg 
aaaacggtca agcagttaaa agcaatcaat tatggtcacc cagcaatgca gataaatctt 



<210> 263 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 263 

agaacaggga gaagagagga agagggagct gcaggtgcca gaagagaaca gggcggactc 
tcaggacgaa aagagtcaaa cctttttggg aaaatcagag gaagtaactg gaaagcaaga 
agatcatggt ataaaggaga aaggggtccc agtcagcggg caggaggcga aagagccaga 
gagttgggat gggggcaggc tgggggcagt gggaagagcg aggagcaggg aagaggagaa 
tgagcatcat gggccttcaa tgcccgctct gatagcccct gaggactctc ctcactgtga 



<210> 264 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 264 

ttaaaggtag ttttagaagg aagtacaaat tggctttcat cttgcaaaca atcgtttttt 
acttcattat cttaatttgc tttgtcactc ataaaaagga aaccatacct gagttgtaga 
caatgaggaa acacttgagg cttctgctgt gtgttctttt gttattgttg ttattgttgt 
tactcagtaa cttgaatatt gtttaatgtg ttgtaagacg tagagtttat ctcaagctgt 
taaaaatggt aatgtacaaa tgtgaataga cacttatcta tataatatgg gtaagttttg 



<210> 265 
<211> 300 
<212> DNA 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
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<213> Homo sapiens 



<400> 265 

caggaaagtc ttcctagagg taatttttaa gctgattgtt ttagaattag tagaagcttg 60 

ccagatggaa aagtccaggc aaagtgtaac atgaatggga aaggccacag tctagaaatg 120 

gcagagtgtg ttcctagttt gtttgtttgt ttgtttgtac ctgccttgtt ccaggaagga 180 

tttaatgtgg tttatattcc agtcctttaa tgctggaagg gctgagatga gactgaaaga 24 0 

tgggcaggaa gtatatcatc acaagctttg tgtttgatgt taatgtgtat gatttttata 300 

<210> 266 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 266 

tgtgccacca cacccagctc attattatta ttattattat tattattttg agacgaagtt 60 

tcactcttat cccccaggct ggagtgcaat ggtgcgatac tggctcactg caacctctgc 120 

ctcctgggtt caagcggttc tcctgccttg gcaggcacct gtagtgtcag ctactcgaag 180 

gctgaggtgg gagaatcgct tgaacctggg gggcggagat tgcaatggtg tggtctcggc 240 

tcactgcact cgagcctggc gacagagcaa gactctgtct caaaaaaaaa aaaaaaaaan 300 

<210> 267 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 267 

atataactct ggaggtcagg acataggaga tattgattca ggacttgcca gagtatggtc 60 

ttggggtgtg ccctgatatt acaaacaggg atcttagtgg ctaggtgatg aggccatggc 12 0 

aaatgtagat ggaccaagat caatttgcct ttctagatga ggttttctag gtgaaatgtt 180 

tttgaaacta ttttgtagcc tagtataatt tataaaagta gagagaaact ataaatataa 240 

atttggaagg ggttagctaa aaggagaaaa cagcagaatc ttcatatata tagaaatgga 300 

<210> 268 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 268 

cctacttatt ggatgttggc tctttggtgt catggagatg gctttactgt aggtttgtgt 60 

gtgttgcatt acttttcatt gggattgaac tgagaaataa caaacaagct ttaagtggga 120 

aattaaaaaa aagaagtaac ctatgtagat ccaaacttaa aatgtgagaa attattgaaa 180 

tttcattttc tacaaacttg aaattagcct gctaattgta aagttgtttt aataatgctg 240 

acaaatgtca gttacgtttg caaaggagtg tatggttcta ggtatttgcc tactgttacc 300 

<210> 269 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 269 
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<210> 270 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 271 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 272 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 273 
<211> 300 
<212> DNA 
<213> Homo sapiens 



65 



60 



cctacttatt ggatgttggc tctttggtgt catggagatg gctttactgt aggtttgttg 

tgttgcatta cttttcattg ggattgaact gagaaataac aaacaagctt taagtgggaa 120 

attaaaaaaa agaagtaacc tatgtagatc caaacttaaa atgtgagaaa ttattgaaat 180 

ttcattttct acaaacttga aattagcctg ctaattgtaa agttgtttta ataatgctga 240 

caaatgtcag ttacgtttgc aaaggagtgt atggttctag gtatttgcct actgttaacc 300 



<400> 270 

cctacttatt ggatgttggc tctttggtgt catggagatg gctttactgt aggtttgttg 60 

tgttgcatta cttttcattg ggattgaact gagaaataac aaacaagctt taagtgggaa 120 

attaaaaaaa agaagtaacc tatgtagatc caaacttaaa atgtgagaaa ttattgaaat 180 

ttcattttct acaaacttga aattagcctg ctaattgtaa agttgtttta ataatgctga 24 0 

caaatgtcag ttacgtttgc aaaggagtgt atggttctag gtatttgcct actgttaacc 300 



<400> 271 

ccacatttaa gtgagatatg ggaaggagga gcagattgtt tttgaaggga ggaagagcag 60 

ttacttaggg tcaaattaag ttgtaaaatc ccccccggga ttttgtatgt aagtcaaagt 120 

gaattgtatt tggaagaaga actggggagc ccacctctgg tatttttttt atgtccctca 180 

tatggacaaa taaacctctg gtattaaatg aattttcttt tgggggattc tatatattcg 240 

ggatttcaac caccaaccta tctggttttt cccgctgaaa tgttgggtga tggaatcagg 300 



<400> 272 

gaacgcttcc attttatacc tgtgtctagt tagtttctgc ctatctatcc aagaagcttt 60 

tatcaagggt ccaccatgtg ccagccactg aagtagatat aaatacaagg atgtgtaagg 120 

tatggatgat ggtatacgaa ctgtcatctt actggatttg tccgctctgt taaagatacg 180 

gttccgaaaa ctttttaaag ccctagagag ggctttaagg caatgtagca tcatatatag 240 

aggcatcaac ctgttcatat ctttctattt aacagaactg tgcacctggg cacaagggtg 300 



<400> 273 

gaatggcgtg aacccgggag gcagatggtc ttaaagtggg gagacccggg ttacaggcct 60 

gactgcatca ctaactcgct gtgtgtccct gggcaagtca gtgcagtgca gtagcctctc 120 

cgtctccgac tgaggagcaa agccctcggc tcaagatcct cacctacttc acagggattt 180 
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gaaatagtgc agtcaacagg aaaagaaaag cgctatagaa atgctcgacg ctatcacttg 240 
gggcccacgt ggaagtatca acgtataaat tggcccaggc agacagaagg atgcagggga 300 

<210> 274 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 274 

ggaaccaggg gctgcagaac cagcccctcc ccaatgagga ccccctctgg acgcccctcc 60 

ccatggagaa caccaggagc cacagacccc agaccacagg agcacacagg ggagggcacg 12 0 

gggcggccgg ggcagggtgt ctgctgcctc gtttatggga tttgctccgc gtctagcaca 18 0 

ctgctgcctg cagtgctcct gtcccctgca gtggctactc tgggcctacg ggcctaatcc 24 0 

tggttggcat gaaaatgtcc tgaggctact gtgacaaatt tccacaagct gagtggctta 300 

<210> 275 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 275 

ctttgggaag cagaggtggc aggatcattc cagcccagga gttcaagacc agcctgggca 60 

acacagtgag tgagaccctg tctctattta agaaaaaata attaagaaat tttattaaaa 120 

aagaagaatc aggaaaccaa gtccaaccca actaaacctc aaatgaacca gcccctaaca 180 

cagatgaggg gatttgggac tgataagctc tgtgctgtgt ccatggcccg tcatttatca 240 

aggctgcagc tttgtaaatg tggctatttt tatgttgtgt atagtttcta tcatttattt 3 00 

<210> 276 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 276 

tttgtatttt tagtagagac agggtttctt catgttggtc aggctggtct caaactccta 60 

acctcgtgat ccgcctgcct cgacctccca aagtgctggg attacaggca tgagccacca 120 

tgcccagcca aagatcattt ttttatatag acttcagccc tttgtaaata ttgtaactgg 18 0 

ggagtataga gtagaaaaaa agtatagtta aaacatttgt tctacaaatt aacctttaaa 24 0 

aatataatta ctgctaaaaa tagagtgctg ttacacttaa ggaaaattag tgccattttg 300 

<210> 277 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 277 

ctcacacagc atgtgtcaga tccatggggt aggagtcggc cagagacttg gtaacagaca 60 

gattgctgga tcccacccct agactctctg attcagttag tttggggtaa ggcgcaagac 120 

tgaatttttc acaagtttcc cagtggtgct gatacttctg gtccaggaac ttagtgggag 180 

agaacgacta atctagacca tttcacttca cattctgagc ttcttgtaca ctgtcacact 24 0 

gcatcctttt aacaatgcat tccctatcct attgcaatac tgacatctca tcaatatttt 300 
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<210> 278 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 278 

ctgacaactt gattgggttc tccttcaggt ttgaagcgcc ctcgagaagt gtctaaagga 60 

gacagttgat agccaaacaa cagttttgga ttcactgact gattatgaaa gaagcagtag 120 

actggtatca agaatcagtc agcaaggagg ccctcaccag acgccagtgc catgttcttg 180 

gacttctcag cctccatatt catgaactaa gtttttggaa tccttaggct tccacgtgtg 24 0 

gaaagcctga gctaacctac tggaggatga gccatcacct ggagcagatt caggccatcc 300 

<210> 279 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<40U> 279 

ggtaaaccta tttatataat agaaggatga ttataaacat ttaataaatt atatcaaata 60 

gatattatat attaaatggg cagataatag aaatctgtcc aagcaaaact ctggataatt 120 

tttatgttgc cttatttttt gttttctgtg aactccaaga aaaatgagat accagtttgg 180 

aacagatgta atattgctga tttaacagtt tagggatact ccccaagttc aataattttg 240 

ccaagataca aatttaaatg gaacctttta tgaagcttca tagtgtgtga agaacttacc 300 

<210> 280 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 280 

ataactgctt gcgaagatgt agtttctctt tggaaagcta cacacacgag attatacaca 60 

tcaggcactg gaactatctg taatactgga acctctgcga agtgccaggt ataaagtttt 120 

tcccactgcc aagcatccag agctttggga aatttggaaa tcagagagat cagggcattg 180 

ttttgttcct ctgatgatga aagtgaaaag caagtactac tgaagtctgg aaatataaaa 24 0 

gctgtgcttg gcctgacaaa gaggaggcta gttagtagca gtgggaccct ttctgatcaa 3 00 

<210> 281 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 281 

caccatcgaa tatttttatt tattttgaga gacagactct gtcacccagg ctagtcttaa 60 

actgttggtg aatcttaagt gattctccca cctcagcctc ccaaagtgct gggattacag 120 

gcatgagcca ctacccttgg ctgtgatcaa gtatttagtc tgttgttaaa tgtttactaa 180 

atagtctgaa gtagagaaaa tagcacccaa tctaaaataa ggtgaggtct agtcacttat 240 

ttaaatctac attttaagct atagtttact attagtttaa actttaagac aggtaatgtt 300 

<210> 282 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 282 

gcaaccttcg cctcctgggt tcaagtgatt ctcctccctc agcatcccaa gtagctggga 60 

ctacaggcac gtgccaccac acccagctaa tttttgcatt tttagtagag gcagggtttc 12 0 

atcatgttgg ccaggctggt ctcaaactcc tgatctcaag taatctgccc actttggcct 180 

cccaaagtgc tggcattaca ggaatggagc caccgcgccc agcctgattt ctttttttag 24 0 

gtcttgtcag gaaagatatt gattcttttg attcgtgaac atggtttttg gtcgtcttta 300 



<210> 283 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 283 

cccaggtagc tgagactacc cacaccttgg tcccagctac ttgggaggct gaggtgggaa 6 0 

aatcactttg cccaggaatt caaggccgca gtgagctatg attgcaccac tgcactccag 120 

gcaacagagt gagaccctgt cttaaaaaaa gaagggagaa agtgtcagat ggtgatgagg 180 

tctggggggg aaatagagaa tggggatcag gagtgtggat ggtggtattc cctcaccaag 24 0 

aggtgacatg tgagcaggga gctgggaggt gagggtgtga cccgtgtgga aatcagggaa 300 

<210> 284 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 284 

ggtgtcctcc ccagtgcgcc gcgatttttg tgtccaagcc ccagagtccc tctgagacca 6 0 

acccccagcc agcacagact tcctgccttc ccagctcgga agcgccctcg agaagtgtct 120 

aaaggagaca gttgatagcc aaacaacagt tttggattca ctgactgatt atgaaagaag 180 

cagtagactg gtatcaagaa tcagtcaggt ttttggaatc cttaggcttc cacgtgtgga 240 

aagcctgagc taacctactg gaggatgagc catcacctgg agcagattca ggccatccta 300 

<210> 285 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 285 

aattccgttg ctgtcgggcc gccatgtcat tctggagaga gacagagtaa aacaaagaag 6 0 

gtgatgggta aagcgcagtc gcctgctata tattgtctat ttttggtttt tcacttacct 120 

tttatattta tgtcttttat gtacaacagg attataagta gcttgagtcc agtgaatata 180 

ccatttcatt ttgctatcct tcactgcact tagcttagag gaaataatca cagcttatta 240 

ttgattaatt aattaattaa tagatgaatg gtgaacacat gactatcatc ccaagaaatg 300 

<210> 286 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 286 
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agccaatgag gcttttgcct gccagcagtg gacccaagcc attcagcttt acagcaaggc 60 

tgtgcagagg gcccctcaca atgccatgct ttatggaaac cgagcagcag cctacatgaa 120 

gcgcaagtgg gatggtgacc actatgatgc cctgagggac tgcctcaagg ccatctccct 180 

aaacccatgc cacctgaagg cacactttcg cctggcccgc tgcctctttg agctcaagta 240 

tgtggctgaa gccctggagt gcctggacga cttcaaaggg aaatttccgg agcaggccca 3 00 

<210> 287 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 287 

gggtgacaga gtgaaactcg tatctccaaa caaacaaaca aaaagtcctt aaacatatgt 60 

gaacaaaaat tttgtgatgg aaggattcta gttaatgagt attgcatcaa gatttacatc 120 

tttcttacta aggaaaagag ttaataaaaa ttgttcttta ttttacaggc agttactgag 180 

gctcttccca gatctcagta aacagccact cagccttgaa aatggagtgt tgttgtttct 240 

aaacatatat ttatgtcatt tattaagtac agttcactta aataacataa gtagattttc 3 00 

<210> 288 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 288 

accactaaca gcatctactt gactactgat actttgatca tggagttagg gcatgccact 60 

tgatagaaat ttgaagagca attatatttt tcaaaaagag ttttgaataa tgttaagata 120 

gattgcaaca tgactatcaa ttcttccctt cccatcaaag gagagagtcc gtttatccag 18 0 

cctttgaatc ttgattattc aagtgacttg cttcacccaa tgtaacatta ataagcacaa 24 0 

tacaagcaga ggcttgccaa gaacttggtt tgtttctaat gcttagaaga agaatggtgt 300 

<210> 289 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 289 

tgtccttatc tgaaattcag cgatcttcat gaataagcat ttctctgatt gtggnatatg 60 
cctttaattt tatttctaga gtgacaaatt tttggttttg acagtttttt tctagcttta 120 
tagtttcttc ttggggagag aatatgtcaa cctcactcca tcatgctgaa gtaaatcttc 180 
atctcttaat tttatctctc aaaaatatcc taaggattcc ctctggagcc tgataagtaa 240 
ttgcagtatc tggtttctat ggttggatga ttcaggattc caggaataat agttactttt 300 

<210> 290 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 290 

ggaaccatga gaaccgaagc tagaattgct attgaattac tttattttct cttcccttat 60 

tgggtagaga tacatcatta ctggcctcag gggtttaccc aaagaaaggg tatttttgag 120 

caaataatgt gatttcctgg ctattttgtt gggggcttaa gatttttttt tttcaaatgc 180 
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atttttagtc actaaaaatt aactgtcgta ccatctagaa ctatactgtc cagtaccata 
gcctctagcc gtatgtagct atttgtatta agattaatgg aaattttaaa tccagttcct 



<210> 291 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 291 

tatgatttta tttttggcct aatataggaa tgtttaaaaa aggcttttct atgaaaatta 
gaaatttata cttgaaatta aaagtctaca agggggagga ccttaaagct aagctaccag 
taagacaatg aataattcag aagagaacac tattctttta ctgactgagt gcccaagatg 
ccaatttcca tgaagtcttg atttatatat atgtacacat gttatgcaca tacatgtttg 
ttttctaaca gttattcttt aagcttttga gataatttta gacttacaga agagttggaa 



<210> 292 
<211> 278 
<212> DNA 
<213> Homo sapiens 



<400> 292 

cccagaccta tggagtcaga cagtaggttt gaggcccagc aatctatggt ttaacaagcc 

atccaggtgt ttctgatgca cagtgaaatt ggggtaccac tggtattagg tttggtatgg 

caactttttc atcacttgtt ttatgtagtt gtctgatcaa ttgtgaaaac ataatgaatg 

ttggaaatgg aacagtaaaa taacgaaagc caactttttt tttttttttn rmniinnnrinn 
nntgntttnn cccccaggnt gnanngcagg gncccaat 



<210> 293 
<211> 297 
<212> DNA 

<213> Homo sapiens 



<400> 293 

ggaaggcagt gggaggagag gaccaagtct caaactccag aagccccacc tccctgagct 
cagctcctct gccaagcccc ctcagcgcga agtcctcgtc cagagaaggc aacggcgaga 
aacaaatcca acatcctggg ctgctttttc cttcccccac tttttaaaag tttggtgtcc 
aagtcacttg acaaacccag accctaacaa tgatattttg tgtagaattc tgggatcaaa 
atataatttc aaaaataata tattttctga catcccccaa aaaaaaaaaa aaaaaaa 



<210> 294 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 294 

ggaacagttt gagcaaaggc tctcaagtaa tagggtgtct gacttgttca tttttgaaag 
tagaactaat aggatttctt attggaacgt agggtgtaag agaaaagagg agtcaaaaag 
agccacaaga tttttggtct cagcaattag aaggatagaa ttgacattta ctgagatttt 
tgtttttgtt tttgagacgg agtttcgcta ttgttgccca agctggcgtg caatggcgtg 
atctcggctc agtgcaacct ccacctccca gattcaagcg attctcctgc ctcagcctcc 



240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
278 



60 
120 
180 
240 
297 



60 
120 
180 
240 
300 
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<210> 295 
<211> 299 
<212> DNA 
<213> Homo sapiens 



<400> 295 

gtaatattga tgtgattgtt gtcgcttgag aaaaaaaggc aacagctgat tctttcaaca 
actgtcacag aatggctggg ctgagaacgc tgcccagggc cctgcagctg gcgggagnnn 
nnnnnnnnnn nnnnngtgcn tgctgcaaca tntggttana tngtatcctt ccctanagnt 
gctacnnctt rmatcccctt gtnaatatgt tgagntnnct tngcnttcnn gntnntccng 
ntnnttgaca cntatgnaan ttntntngtc tngctctgct ngatnncttn nangctgcc 



<210> 296 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 296 



gcagaacctt ttcccctcta ctcttgtcta aaagttctgt gtggcacaca gagatgcgac 
ctactcaatc tgacttagta aaaccatgct gaaaaatttt ggtctaaaaa ggacccatac 
ccagcaccca tgaaataaaa gattcatctg taattgggat tcaaagggat taaattcctt 
tggtcatact cataaatagc actaaagtgt tataacattt tcatttacct atttttagtt 
ccttcatttt aacttaataa aaatcttgga ttgatattct tttttttttt ttttgggacg 



<210> 297 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 298 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 298 



<210> 299 
<211> 300 
<212> DNA 



60 
120 
180 
240 
299 



60 
120 
180 
240 
300 



<400> 297 

gctaggatta caggtgtgag ccaccatgcc cagccactta tctttaaagg attaagttta 
tgtttcctac tatgggaaac catcccaccc caaacttgat gaccgcatta tgtgctttta 
tagaacatgg cacttctcca ggatagcatt tattctgttt tgtaagtgtg aatgtaatta 
ccctacacac agcatacaca taatcttcat attctttgcc ttgtcttgtg aaggcaaggg 240 
ccatgtctat cttattcgtc attagattcc cacatccaac atagtcctgg ggacagcacc 300 



60 
120 
180 



ccaaatctgc ctagagattg agttcacagt gtatgttctg ggggcgctgg tgcagtcagc 60 
ggtccagtct ccagcctgca ggcgtgcaca ctggggtgga cgatgggtgg ccccgcaggt 120 
gtacacattt gggtggcccc ggcccctata ccccagtgtt ctctttgatc cagtcccgaa 
acagagggag ccttgtgtac acgcctccaa agtggagctg ggaggtagaa ggggaggaca 
ctggtggttc tactgaccca actgggggca aaggtttgaa gacacagcct cccccgccag 



180 
240 
300 
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<213> Homo sapiens 



<400> 299 

ctccattgtg aagatccagg catttttccg agccaggaaa gcccaagatg actacaggat 
attagtgcat gcaccccacc ctcctctcag tgtggtacgc agatttgccc atctcttgaa 
tcaaagccag caagacttct ctgctgctgt gatctgcaca ccctccaacc tgggcaggga 
ctggggggat gcagtgtgtg ttagtgccca tgtggcattg tggcactgtt gccccccatg 
gcggcatggg caagatgacc ttccattagc ttcaagtctt gttctcttgt ctgtggtctg 



<210> 300 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 300 

agcaattcca ctcctagctc cacccacagg aattgaaagc aaagacgcaa acagatgcct 
gtgcaccaaa gttcacggca gcatccttcg ccatagtggc agcatccgtc gtcacagcgg 
catcatcctt catcatagcg gcagcatccg tcgtcacagc ggcagcatcc ttcgccacag 
cggcagcatc tgtcgtcaca gcggcagcat ccttcgccaa agcggcagca tccttcgtca 
tagcggcagc atcctttgcc atagcggcaa ggtggaaacc ctgtccatcc actgaggcgt 



<210> 301 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 301 

tcacagatat gaaagttcag tcagaggggc tgggccgaca tctgtgcttt tccctgcagg 
atttttagga tcagtgagac ggtgtgtatt tggaagcatt tcaaatgtgt taccatcgtg 
ttacttccgt gggcacctgg tgttattggt tggactagtc aggattctcc agagcagcag 
aagcaatggg atgtgtgtgc atgtgtttgt gcagagacag aaagagagat tttaaggaac 
tggcttatgc agttgtgggg gctagcaagt ctgaaatttg cagggcgggc cagcaagctg 

<210> 302 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 302 

tcaccaggaa tacagtgaca ttaaaagtgt gatatggttt agctgtgccc ccacccacat 

ttcaacttga actgtatcta tctcccagaa ttcccacatg ttgtgggagg gacccagggg 

gaggtaactg aatcatgggg gctggtcttt cccgtgctat tctcgtgatg gtgaagtctc 

acgagatctg atgggtttat caggggtttc cacttttgtt tcttcatttt ctcttgccac 

cagcatgtaa gaagtgcctt tggtctccta ccatgattct gaggcctccc tagccatggg 

<210> 303 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 303 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
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gccctctcca ttttctgagg aggtgatatt tgggcagatt acaaactgag gaagcatact 
ggatagacat caggatgaag agaataggca gttgaaaagt cccagaaagg ggagtgtgct 120 
tagagtgttt gaggaacagc aaggaagcaa gcccttgttg aaacagattg agcaaggtag 180 
aaagtggtaa aagatgaagt taaagaggta gctgagagcc agatcatgta aagccttggt 240 
aaggactgac ttttatttta agagggttag gaagacattg gtaggttttg actctggctt 



<210> 304 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 305 
<211> 300 
<212> DNA 
<213> Homo sapiens 



300 



<400> 304 

aacaggaata tggaaagaaa ctcagagccg agttagtgga aaagtggaaa gcagagagag 60 

aggctcggct ggcaagagga gaaaaggaag aggaggagga agaggaggaa gagatcaaca 120 

tctatgcagt caccgaggag gagtcggacg aggaaggcag ccaggagaaa ggaggggacg 180 

acagccagca gaagttcatt gctcacgtcc ctgttccctc gcagcaagag attgaggagg 240 

cactggtgcg aaggaagaaa atggaactcc tccagaagta tgcaagcgag accctgcagg 300 



<400> 305 

aatagtagaa agggtcccca ttcctgctca gcaccgcacc tctctacccc cccacagaca 60 

cacatgcaga cacacacatg cagacaacac gcagacacac acatgcaggc actcacatgc 120 

aggcccatgc acacacacgt gcacacacat gcagagacat gcagacacgc aggcacacat 180 

gcacacatgc aaagacacgc atgcaggcac acgcagacgc acacagagac acacatgcag 24 0 

atacacatgc acacacacat acacacactg gcccctgttt ttctgtggtg tcactgggtg 300 

<210> 306 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 306 

cagcaaagac tttatttttg tacagaagat ggtgaagtcc aagacggtgg ctcagtgcgt 60 

ggagtactac tacacgtgga aaaagatcat gcggctgggg cggaaacacc ggacacgcct 120 

ggcagaaatc atcgacgatt gtgtgacaag tgaagaagaa gaagagttag aggaggagga 180 

ggaggaggac ccggaagaag ataggaaatc cacaaaagaa gaagggagtg aggtgccgaa 24 0 

gtccccggag ccaccacccg tccccgtcct ggctcccacg gaggggccgc ccctgcaggc 300 

<210> 307 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 307 

gctgcttctg gctggggggt ccttggcctt catcctgctg agggtgagga ggaggaggaa 60 

gagccctgga ggagcaggag gaggagccag tggcgacggg ggattctacg atccgaaagc 120 

tcaggtgttg ggaaatgggg accccgtctt ctggacacca gtagtccctg gtcccatgga 180 
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accagatggc aaggatgagg aggaggagga ggaggannnn nnnnnnnnna ntggccttnt 240 
gtggcctcca ccagcagctn tnnannatga catggagtcc caactgnacg nctccctcat 300 



<210> 308 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 308 

agttaagagt gtgaacccta gatttgccat ctgaaagtca tgtgtccttc agtgatgcat 
ttaacctctc tgtgcctcaa atttctccct ctggggtatg ttaggagtat acaaattaac 
acatgtaaag tgcttagaat agattggtac tgttaaatat gagctaacgt cacatttgat 
atttttttaa aaagaaaaaa tcattatgga gtctcagtcc tagagattct gattcattaa 
ttctgcttct cggcaaggag cgatttgctg gtgtagacat tccgggtccg tgtaaagggt 

<210> 309 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 309 

ccaacaccca gttctcactc tgtcatccag gctggtgtgc agtggtgcaa tgtgggctta 
ctgcagcctt gacctccagg acaagtgatc tcccacctca gcctccggaa tagctgggac 
tacagctcaa caacgcccct ctgaaagtag gactcttgga aatgaacctt gttgggagta 
aagctgaacc ttcacctctc ctttccagga ttctactcca ttcatacggc ctcacactga 
attaatggtt ctagcagcca catcactttg ttacccaatt gatctagtag taaagtcttc 

<210> 310 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 310 

aggaaacacc cccttataaa accatcatat caggctgggt gatctgacag agctagacac 
tgtcaaacaa acaaacaaac aaacaaaaaa accccatcac atctcatgag acttatttac 
tatcatgaga gcagctcagg aaacacccac tcccgtgatt cagttacatc ccactgggtc 
tgtcccacaa attgtgggag ctacaattca agatgaggtt tgggtgggga cacagccaaa 
ccctatcacc atgtaaaata atatctaatt tgtagagatt aaagaacaag ataacttaaa 



<210> 311 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 311 

ttntgcagat ctccagcaca agcctctgct agttgatctc acggtagaag aaggtcaaag 
attaaaggtt atgtttggtt cacacactgg tttccatgta attgatgttg attcaggaaa 
ctcttatgat atctacatac catctcatat tcagggcaat atcactcctc atgctattgt 
catcttgcct aaaacagatg gaatggaaat gcntgtttgc tatgaggatg anggggtgna 
tgtaaacacc tatggccgga taacnaagga tgtggtgctc caatggggag aaatgcccac 
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<210> 312 
<211> 275 
<212> DNA 
<213> Homo sapiens 



<400> 312 

cctccctgga tgtgcagaca tggaggagga cagaaggccc agctcagtgg cccccgctcc 
ccacccccca cgcccgaaca gcaggggcag aggcagnnnn nnnnnntaag ngtgtnnaan 
tntnaatttn ttcctntttt ttttnnnttn aaatatnntg nnnttttttn ntantantta 
ttatnntntn nttattannn tntttttcnt ntnttacttt gttnttgatt ttanncnttt 
natntttttt ttgttcttct nttntattnn atctt 



<210> 313 
<211> 300 
<212> DNA 
<213> Homo sapiens 

^400> 313 



tcctgtcttc ttgcccaaat gttgcatttt ccaagaccac tctggcctgc catgccaccc 
attctgtgcc tataaaaacc ctgagacccc agcgggcaca cacacaagcg gctggacgtc 
aagaggaaca cactggcaga agaacacatc gaaagacgct ggcaggccat tgatggtgga 
acgattcgga cgccaaggga aattcggcca aggacagtag gagatcccgg ctgctgagca 
gccagactcc agaggaagac taccttccca tgctatcccc cttctggctc cccagccatc 



<210> 314 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 314 



ataagggtgg ggccttaatt cagtagaatt ggtggcctcc taagcagagg aagagagatt 
tttctttctc tctctgccat gtgaagacag tgaggagtcg gccgtctgca agccaagaag 
agcccttatc aggaacagac ttggctagca ccttcatcgt ggacctccag cctccagaat 
tgcaagaaaa tacatttccg tcgttgaaac cacccagtct gtggtatttt gttatggcag 
cccaggcaga ctaatacgtg aagcctgctc taaatagata aaataagaaa ttactacaga 



<210> 315 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 315 



60 
120 
180 
240 
275 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



gtctcagtgt ggcctgtggg gctggtgggc ggcctgcgat tcgagggccc tcaggtacag 60 
gacggccggg tagtgggctt ccacacagca tgggagccca gcaggccctt ccctgtggat 120 
atggctggat ttgccgtggc cctgcccttg ctgttagata agcccaatgc ccaatttgat 



180 



tccaccgctc cccggggcca cctggagagc agtcttctga gccaccttgt ggatcccaag 240 
gacctggagc cacgggctgc caactgcact cgggtactgg tgtggcatac tcggacagag 300 

<210> 316 
<211> 300 
<212> DNA 
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<213> Homo sapiens 
<400> 316 



acaggaaaaa 


ttaatataat 


60 


atttataaat 


ttatatctaa 


120 


aaattgaggg 


gattaaaaag 


180 


ttttgagggc 


attaaaaagg 


240 


ttaatgaaaa 


cacagcactt 


300 



<210> 317 
<211> 295 
<212> DNA 
<213> Homo sapiens 



acactgtccc actccatcac ccaggctgga gtccagtggt gtgatcatag ct °9 ct 9«^ 
cctccagttc ctgggttcaa gccatccctc ctgcctcagc ctccccagta gctggaacta 
caggtgtgtg ccScacacc tggctttaca tttttctgtg gggacttact *tgttgccca 
ggccggccto aaactcctga gctcaagtga tcctctgcct cagcctccag agtatctggg 
attacatatg tcggctaccg tgtctggccg ttcacatctt tggccactat ttgct 

<210> 318 
<211> 261 
<212> DNA 
<213> Homo sapiens 

cctgaatata'aagaggagga ggaagaccaa gacatacagg 9ag«aatcag tcatcctgat 
acraaaggtgg aaaaggttta taagaatggg tgccgtgtta tactgtttcc caatggaact 
cgaatggaag tgagScaga tgggaagacc atcactgtca otttetttu tggtgacgtg 
algcaggtca tgccgaccca agaannnnnn nnnnnnnnnn nnntngccnn aacnnttcac 
caaatncccc gggggggctt g 

<210> 319 
<211> 300 
<212> DNA 
<213> Homo sapiens 

gggacctctg'cccaagaaag cctgggtatt gaccaaggtt tcccccccac tgagacagcc 
tgagatatgg cctcatggga agggaaagac ctgactgtcc cccagcccga cacctgtaaa 
gggLggtgc tgaggaggaa tagtgaagga gggaggcctc tttgcagttg ^ataagagg 
aaggcttctg tctcctgctt gtccctggta atggaatgtc tcggtgtaaa 9^gaccatt 
cccattcgt? ctattctgag ataggagaaa accgccctgt ggctggaggt gagatatgct 

<210> 320 
<211> 289 
<212> DNA 
<213> Homo sapiens 

<400> 320 



60 
120 
180 
240 
295 



60 
120 
180 
240 
261 



60 
120 
180 
240 
300 
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Z12r.t, t 9gCCaa99gg ctagacctcc caggctaagc ctcagattca gtgcaggaca 
caagctcatg cccccgtctt gccagtgaca cttgaagcct occgacttcc LIgagTgct 
tcaggacaca ttttgagtgg tattttcttt tctttttttc ttcttttttt tttSSn^ 
nnnnnntngt tntgtnnccc aggctgnann gcaggggcct gatntnggnt aantg^ct 
ttngcctccn aggttaaagc nattttttng cotaancctc naaagtacc ntgnaaCC 



<210> 321 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<400> 321 

gaaagaccga gatagagaga gagacagaga cagagagcga gaccgtgatc gggacacraaa 
aagagaacgc accagagaga gagagaggga gcgtgatcac IgtcctLac cSgtgtttt 
caacagcgat gaagaacgat acagatacag ggaatatgca gaaagaggtt atgagcgtca 
cagagcaagt cgagaaaaag aagaacgaca tagagaaaga cgacacaggg agfaSggJ 
aaccagacat aagtcttctc gaagtaatag tagacgtcgc cataaaJff I^**!!!!* 



<210> 322 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 322 

cgcootttaa ctgcagttct gctctatttt cttttctctc tctggagctg agagtcaoaa 

iSctcgcc :t::t::iti cagc r ccaa cactaagctg £T~i:i 

gcccctcgcc ttcctcaacc cacctggcaa gtcttcttag gatctgatco cagttttcta 
gaagcaatcc taccccagcc caagcttccc aagagtcgag ccttaatcct tSacttc? 
cagtgtcaga gcagaaatga atcctggggt tgactgtgtc cattcgggtt attagcagct 



<210> 323 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 323 

agattatgag catgtagaag atgaaacttt tcctcctttc ccacctccag cctctccaaa 
SKcctcca f tgaaggaa ct ^-tga tgaagagtca ggaaatggag Sctgttcc 
tgtacctcca aagagaacag ttaaaagaaa tatacccaag ctggatgctc agagattaat 
ITallT 9 * 9gaCttCCag ""aaggca tgtatttgat aaggcaaaat tc^aggtaa 
aggtcatgag gctgaagact tgaagatgct aatcagacac atggagcact gggcacltag 

<210> 324 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 324 

gtctgagaag tcaaggatcg gggtgctggc ctattcagtt cctggtaagg gctgtcttcc 
a?a^aS ttgaaCtaCt tc "gctgtg tcttcacaag catgccccca S££S 
ataagaactc cagaccccaa actcagctca tacacacacg gaagagagaa gcatctgaac 



60 
120 
180 
240 
289 



60 
120 
180 
240 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
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atcaaaaaga gaagaagctg ctggacatca gaaactgtga aaggagagga gtttggctga 
gctccSggg Sgactgcc? gcacattcta tccccttttc agttccccat cctgctgtca 



<210> 325 
<211> 283 
<212> DNA 

<213> Homo sapiens 



<400> 325 

gtccgaagaa aaagactgtg gtggcggaga tgctctctcc aatggcatca agaaacacag 
aacaagtttg ccttctccta tgttttccag aaatgacttc agtatctgga gcatcctcag 
aaaatltatt ggaatggaac tatccaagat cacgatgcca gttatattta atgagcctct 
gagctScta cagcgcctaa ctgaatacat ggagcatact tacctcgtcc acaaggccag 
ttcactctct gatcctgtgg aaaggatgcn ngtgtgtagc tgc 



<210> 326 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<210> 327 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 327 ^ ^. ^ 

ggtagactgg ctagggatcc tggacccagg gttccacgta gcaacacctg ctgagttctc 
tgggttttct tcctgcctca tgtagcccag acttggagct gaagaagctg gaaacatgs* 
aacaccaaca gctacagacc aaaaaaagtc ccaacaaagg cctgtcagtc tgccagcctg 
ttctgtggat ttccaactca agattgcagc atcaactcac acctgaagtt ctggcttccc 
tacaaacttt gaacttgcca gtccccacaa tggcataagc caattcctta aaatgaatgt 



<210> 328 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 328 . 
gtcacaggca ggtttaatgg ccagtttaaa acttatgcta tctgcggggc «ttcgtagg 
atggtgagtg tttccctggg ctttgctcat cacttcggga catcgtggac tttaccgtgc 
gcattggagt gtgtgatggt gcctgagtag atctgctggc agagtagttt gagccagctg 
gactgggctg gccgcctgcc gcttcttgag ggtggaagag gggtgctctg agaagacact 
caggcagcag actctgcctc tcacttaagg tgcccccccg accccgctcc accatagtca 



240 
300 



60 
120 
180 
240 
283 



atgacatcct cattatccac actgcaaagc caaccatccc tatgatgggt ^attgtgga 60 
tcatgactta gtgggtcaag agtttggaag tggctcagct gggcggttct ^ctgctccat 120 
gtggctgcca gatggtaccc tgctggtggg cagtctggtc tagagggtcc ^^gatggctt 
Lctcacatg cctggcatct tgacagggac agctggaagg caaggttcag ctgggactgt 
ccacagagct cctccctgtg gcctttccag catggtggtc tcagggtagc tggacttcct 



180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
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<210> 329 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 329 

aataS^f ^IT tctcatca ^ ^ctccatgt gtttttgatg cctttgaact 
ggtattttta aaatttcaat ttctaattgt tcattataga aacacaattg ggttttatat 
tttllT.l 9 tat " v tgCaa ct "cctaaa ctcactagta attctagtag ctttttttgg 
£S£S? a9g !" ttCt Start—tag tcatgtcatt tgtgaataaa gccatttttt 
tttccttttc aaattttgtg ccttttattt cttattctta ccatatcaca ttggcaaaga 



<210> 330 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 330 

tt aa ?S tC ^ gg f 9CtggC c ^"cagtt cctggtaagg gctgtcttcc tggcttgcag 
ttgaactact tcttgctgtg tcttcacaag catgccccca tcctgtgccg ataagaactc 
aSrT a ! tCagCtCa acacg gaagagagaa gcatctgaac atcaagaaga 

I 339339 ? Ctggacatca gaaactgtga aaggagagga gtttggctga gctccagggg 
aagactgcct gcacattcta tccccttttc agttccccat cctgctgtca gccacattS 



<210> 331 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 331 



accgccctgt ggctggaggt gagatatgct ggcagcaata ctgctctgtt actccttgct 
acactgagat gtttgggtaa agagaaacat aaatctagcc tacgtgcaca tctgggcaca 
gtacctttcc ttgaacttat tcgtgataca gattcctttg ctcacatgtt tccctgctg^ 
Saaatact " CW *? C «^acact cccctcgcta agacagtaaa aataa^gatc 
aataaatact gagggaactc agaggccagc gccggtgcgg gtccccccca tgctgagcgc 



<210> 332 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 332 

ggaaaaacaa caggtttgag tcctataaag ccataattta actccagtag ctgatgtcag 
TatlT 9 C !, tatgtCCt attt ^tgg cagcagcgcc agcccagcaa gaaggctgg^ 
ggttgtcaag gttgtcccca gaccttgctt gcagtggttg gagaacccag ggggctgcct 
Sctta^f ggC , C ! ga9gg -Scgggcag ctctagccct ggagattgtg gtcacattgg 
ggcttgttta ggattggagg gccaggtcac ctccccagcc accctccctt ctctcctctg 



<210> 333 
<211> 300 
<212> DNA 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
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<213> Homo sapiens 



<400> 333 

cctcctactc ccaaacaaat ctttggggaa aaaaaaacta ccaactgtca gccatgggcc 
tgacggcgct aagctctggg gctccgtgca ctgacgtggg gccagccaca gggaggcggg 
gatcaagtag cggaggccag gattttggcc acctcccggg caagttgcag ggcagtggcg 
ccgggagcaa aagcagcatg atgcagctca tgcacctgga gtccttttat gaaaaaacct 
cctcctgggc ttatcaagga agatgacact aagccagaag actgcatacc agatgtacca 



<210> 334 
<211> 262 
<212> DNA 
<213> Homo sapiens 



<400> 334 

accatgccca tttgtttact cattgtctat ggttgctttc atgccctcac agcaaaggcg 
agtagttgtg atggatcaaa tggcccacaa agcctgaaat atttactctt tgacccttta 
cagaaaaaaa ccttgttgac ccctgcttta gagaatgaga agccatgcag ggatcagtga 
tgccagagga agggaaggaa ctgcttccag ctattgtgac aataataata ataataatat 
tgggtctttg actagaacgt gt 



<210> 335 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 335 ^ . 

tcntntctcn ntattnttgn gtagtncctc ntttccttgt ncnntxmtcn nctnttgnct 
tttgcggacc ctcgattcta tctcatatga gtgagaacgc ttaccagtgc agcgaatgtg 
ggaaagcctt ccgagggcac tcggactttt ctaggcatca gagtcaccac agcagtgaga 
ggccttatat gtgtaatgaa tgtggaaaag ccttcagcca gaactcgagc cttaaaaagc 
accaaaagtc tcacatgagt gagaagccct atgaatgcaa tgaatgtggg aaggctttta 



<210> 336 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 336 



gaggacccac tcccccagga ctcctttgaa ggcgtggacg aggacgagtg ggactagcct 
gcgcccccgt cacctccacc tcacctgtgc tgccacttcc tagtgcacac ctcacggctc 
atcctcaagc tggaagatac ctctctggcc ccggcacatg tcacccctgc actcctgcct 
tcccgtgggc acttccacat cctctgggcc tctggcagtt cccagggact gttttcacct 
ctgctgtctc tggggtcagc tgctgctcat cagctgcccg ctagcatgtg gccaggggtg 



<210> 337 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 337 

80 



60 
120 
180 
240 
300 



60 
120 
180 
240 
262 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
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agacaaccca gaaacaaatt catacatcta tggtgaccac ttttgacaaa ggaatgaaga 
acatacactg gggaaaagat aatgtcttta ataaatggtg ctgggaaaac tggatatcca 
tatgcagaag aatgaaacta gacccccatc tcttagcata tacaaaaatc aaaattaatt 
aaaaagttaa atctaagacc tcaaactatg aaacagctaa aagaaaacat cggggaatct 
ctccaggaca ttggagtggg caaagatttc ttgtgtaata cctgacaaac aggcaaccaa 



<210> 338 
<211> 292 
<212> DNA 
<213> Homo sapiens 



<400> 338 

tcaataacca tgaagatgca tcctaccacc gtcagggcaa tcattagata gctgatcttc 
actcgcatct tcatggttat tgagggcaag aaggctgccc aaagacacga gactttaaca 
agcttgaact tagaaaagaa agctcgtctg aaagaggaag cagctatgaa ggccaaaaca 
gagtagcaga ggtatccgtg ttggctggat tttgaaaatc caggaattat gttataacgt 
gcttgtatta aaaaggatgt ggtacgagga tccatttcat aaaotato^ *- t 



<210> 339 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 339 



60 
120 
180 
240 
300 



60 
120 
180 
240 



60 



gaaatttgca ctgatggctc agaaggctta cgtcatggag agtatgacct acctcagagc 

agggggggct ggaccaacct ggctttcccg actgctccat cgaggcagcc atggtgaagg 120 

tgttcagctc cgaggccgcc tggcagtgtg tgagtgaggc gctgcagatc ctcgggggct 180 

tgggctacac aagggactat ccgtacgagc gcatactgcg tgacacccgc atcctcctca 240 

tcttcgaggg aaccaatgag attctccgga tgtacatcgc cctgacgggt ctgcagcatg 300 

<210> 340 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 340 

ctcagngcan cgatcatggc tcagtgcagc ctcaaactct tgggctcaan canagcgggn 
acctcaacct cctgagtagc taggactata ggcacacagc accatgcccc ggctattttt 
ttattttgta gagatggggt ctcactatgt tgcccaggct agtcttgaac tcctggcctc 
aagcaatcct cccacctcgg cctcccaaag tgctgggatt aaaggcgtga gccaccgtac 
ctggcccttg gtggaatctt tagggttttc tattcataca tataaaatca tatcattggc 

<210> 341 
<211> 296 
<212> DNA 
<213> Homo sapiens 

<400> 341 

atccaggtgt ttctgatgca cagtgaaatt ggggtaccac tggtattagg ttgggtatgg 
caactttttc atcacttgtt ttatgtagtt gtctgatcaa ttgtgaaaac ataatgaatg 
ttggaaatgg aacagtaaaa taacgaaagc caactttttt tttttttttt ttnnnnnnnn 



60 
120 
180 
240 
300 



60 
120 
180 
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nnnnnnnnnt tnncccccng ncngnanngc aggggcccaa nntnggntnn ntgnanccnc 
cnccnccggg ntnnncccct ttntcnngcc taacccnccc nagnacnngg aactac 



<210> 342 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 342 

ggcacgatca tggctcattg cagcctctaa ctccggggct caagcaatcc tcccacctca 
gcctaccaag tagctgtgac cacagctgcc cctcaccatg ctaagctaat ttttttaatt 
agatagtaca taaacgtccc aaaattagaa gataaaaaga catgagggat ccattctaat 
ttgtgtttgg agtgtaatgg tccagctcca ttcttctgca catggatatc cagttttaca 
caacactgtg aatgtaatga atgccactga atcatacact caaaaatagc taaaatggca 



<210> 343 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 343 

gttttcatca ctacatattc tacacacact gggaagctct gacaacttat tccctgctat 
tatcaactaa agatcaccct ttctactgct gtctctggag caggagctgg caaactatgg 
cctgctgtct gtttttgtac agttttactg aaacacagcc atgcccattt gtttactcat 
tgtctatggt tgctttcatg ccctcacagc aaaggcgagt agttgtgatg gatcaaatgg 
cccacaaagc ctgaaatatt tactctttga ccctttacag aaaaaaacct tgttgacccc 



<210> 344 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 344 

ccccaacctg cactctaccc acccccatca cctactccag ctcccaactt ttgtggactg 
agcggccgca gagactgggt cgccttggat tccctctgcc tccgaggacc ccaaaagaca 
cccccaaccc caggccagcc ggccctgctc tggcgcgtcc aaaatactac ctagcacagg 
cctctgctcg aggcaccccc aaactaccta tgtatccagc cccagagggc ctccattccc 
aggaagtccc tatgtatccc aacactggca gacacccagc accaccctcc cagacccgca 



<210> 345 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 345 

cccccatcac ctactccagc tcccaacttt tgtggactga gcggccgcag agactgggtc 
gccttggatt ccctctgcct ccgaggaccc caaaagacac ccccaacccc aggccagccg 
gccctgctct ggcgcgtcca aaatactacc tagcacaggc ctctgctcga ggcaccccca 
aactacctat gtatccagcc ccagagggcc tccattccca ggaagtccct atgtatccca 
acactggcag acacccagca ccaccctccc agacccgcaa gaaagtgaat ctcactacta 



240 
296 



60 
120 
180 
240 
300 



60 
12 0 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
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<210> 346 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 346 

gtccacggtg ctgaacatca tcatctttga agactgtagg aaccagtggt ctatgtcccg 60 

accactactt ggcttgatat tgcttaatga aaagtatttt tctgacctaa gaaacagtat 120 

tgtgaacagc cagccaccgg agaagcagca ggccatgcac ctgtgttttg agaacctgat 180 

ggaaggcatc gagcgaaatc ttcttacgaa aaacagagac aggttcaccc agaacctgtc 24 0 

agcattccgt cgagaagtca acgactcaat gaagaattcc acttatggcg tgaatagcaa 3 00 

<210> 347 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<;400> 347 



gctctgagcc caggcgaggc cagggacatg gccatggacc tgtgtcggca ggaccccgag 
tgtgagttct acttcagcct ggacgccgac gctgtcctca ccaacctgca gaccctgcgt 
atcctcattg aggagaacag gaaggtgatc agaccccatg ctgtcccgcc acggcaagct 
gtggtccaac ttctggggcg ccctgagccc cgatgagtac tacgcccgct ccgaggacta 
cgtggagctg gtgcagcgga agcgagtggg tgtgtggaat gtaccataca tctcccaggc 



<210> 348 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<210> 349 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<210> 350 
<211> 270 
<212> DNA 



60 
120 
180 
240 
300 



<400> 348 

gttctgtggc tggcatggtc tgcctgctac tggagagatc tcctgagaat tcaggtttgg 60 

attggtgctg tcatcttcct gggaatgctt gagaaagctg tcttctatgc ggaatttcag 120 

aatatccgat acaaaggaga atctgtccag ggtgctttga tccttgcaga gctgctttca 180 

gcagtgaaac gctcactggc tcgaaccctg gtcatcatag tcagtctggg atatggcatc 240 
gtcaagccac gccttggagt cactcttcat aa&gttgtag tagcaggagc cctctatctt 



300 



60 



<400> 349 

gtcagctttt gatgaagcta tgtcatactg tcgatatcat ccttccaaag ggnattggtg 

gcacttcaaa gatcatgaag agcaagataa agtcagacct aaagccaaaa ggaaagaaga 120 

accaagctct atttttcaga gacaacgtgt ggatgcttta cttttagacc tcagacaaaa 180 

atttccaccc aaatttgtgc agctaaagcc tggagaaaag cctgttccag tggatcaaac 240 

aaagaaagag gcagaaccta taccagaaac tgtaaaacct gaggagaagg agaccacaaa 300 
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<213> Homo sapiens 
<400> 350 

ccatgctgnt aacgggtttc aaggggactc ttgaggaant gccccctaaa atagaacaca 60 

gcaatanggn gggcttcctg tccccaggnc caccccacag tgctntntgg cactggnaac 120 

tctgctangg agngantgna nnnnaccant aannnnnnan nnatcnacan nnnnnnnncn 180 

nnnnnncntn tnnccnannn ntannctncc ntannnnanc cnnccannan cactcncnat 240 

naacgnnnnn ttantgagan nttctcaact 270 

<210> 351 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 351 

aaatgactcc ctgcaaaacc caacccatgc tgctggctgt gggatttttg gtgtaagcct 60 

atctatgcac tctatcagcc agaatttggc atttagctct tagttaaatc tagtaaagga 12 0 

cagtctattg tttaaagaga aggtgcattt gttcctcaat caagcaagag cacctgtgtt 180 

gtactgcttt atatctcatg tatatttata gtaatgaaaa gactttttaa attgtacacg 24 0 

tttcagtgcc tttcttgtgt tatgaaaggc aggtagatat tatagccata ggtaaaaatc 300 

<210> 352 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 352 

aagaaatgcc tctatgtagg tgaagtgttc tctctgcatg caacagtaaa aattaatata 60 
atattttccc cacaaaagaa acacttaaca gaggcaagtg caatttataa atttatatct 120 
aaaggggaat catgattata agtccttcag cccttggact ctaaattgag gggattaaaa 180 
agaatttaaa ataattttga acgaatttat tttcccctca gtttttgagg gcattaaaaa 240 
ggcattaaat caagacaaat catgtgcttg agaaaaataa aattaatgaa aacacagcac 300 

<210> 353 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 353 

cccacactcg gacactgtgg aattctacca gcgcctgtcg accgagacac tcttcttcat 60 

cttctactat ctggagggca ctaaggcaca gtatctggca gccaaggccc taaagaagca 120 

gtcatggcga ttccacacca agtacatgat gtggttccag aggcacgagg agcccaagac 180 

catcactgac gagtttgagc agggcaccta catctacttt gactacgaga agtggggcca 24 0 

gcggaagaag gaaggcttca cctttgagta ccgctacctg gaggaccggg acctccagtg 300 

<210> 354 
<211> 299 
<212> DNA 

<213> Homo sapiens 
<400> 354 
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gaaggaggac ctaggcacac acatatggtg gccacaccca ggagggtagt ggggagttag 60 

atttcagagt ccaggcccta ggttgggacc cactccaaat aatctcctcg gtgtgggtgg 120 

tggttctata gagggataaa tgaataataa acattgttaa aatatacgaa aaaaaaaaaa 180 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 240 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaacnncn ncnananaaa aaaaaaaaa 299 

<210> 355 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 355 

actgttcatc ctaagttcca ctataaacag gctcatgact cgggcacaga cacttcttgc 60 

gtgacttttt cctatgatgg taatgtcctt gcctctcgtg gaggtgacga ttcattaaaa 120 

ttatgggaca tccgacaatt taataaacca cttttttcag cctcgggtct tcccaccatg 180 

ttcccaatga ctgactgctg tttcagtcca gatgataagc tcatagtcac tggtacatct 240 

attcaaagag gatgtggcag cggcaaactt gttttctttg agcgtaggac tttccaaagg 300 

<210> 356 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 356 

ttcagaaaga aacatttaat agggacttac aaacaaatta atgtctgagt ctcaggtggc 60 

agcaagacaa gatggtggat ccccatgcca ttacctgcta gactcagggt ttatatactg 120 

tagtggagag gtgattccga aggaatgttg taagacaatt gaagagcagt aacatcaaag 180 

ttatttgacc taagggcagg agttacagta agtatccact tttatacaag aaacaataga 240 

taaactggaa atcttggagc ccttcctgga actggggtta atgagaagtc aacatggtgg 300 

<210> 357 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 357 

acaaaaccta cagatggaga taaaaattac tactgttatt caacatgtgt tccagaacct 60 

tattttgggg agtaaagtca attgggcaga ggatcctgcc cttaaggaaa ttgttctgca 120 

gcttgagaag aatgttgaca tgatgtaata agaattcatt tctgacatat tttacatttc 180 

tggcaatctc aactcttatt tggaatactt ctgtgcattt gtctgtccac cgtaatttta 24 0 

gaaaagcata tccataacgt ttacagttgt agtacagttg tggttagtta tttgtagtgg 300 



<210> 358 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 358 
ggtgattaca gaagcccaga aggttgatac 
agacacactc aacacattag acggcctcct 
agctggcacc acccagatca ataaactggc 



cagagccaag aacgctgggg ttacaatcca 60 
gcatctgatg gaccctgcac ttgatggacc 120 
ttatttgaat ttgcggcccc ccacccagga 180 
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actgactcag tgcaagaaga cagcttcgac tccctgtgat ttcatctctg accaatccgc 
actcctggct cactggcttc cccaacccat gaagttttcc ttaaaaactc tgctcccgaa 

<210> 359 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 359 

atcaggtgtt cctcccatgg caggagggaa gaaacccagc aaacggccag cctgggactt 
aaagggtcag ttatgtgacc taaatgcaga actaaaacgg tgccgtgaga ggactcaaac 
gttggaccaa gagaaccagc agcttcagga ccagctcaga gatgcccagc agcaggtcaa 
ggccctgggg acagagcgca caacactgga ggggcattta gccaaggtac aggcccaggc 
tgagcagggc caacaggagc tgaagaactt gcgtgcttgn gtcctggagc tggaagagcg 

<210> 360 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 360 

tctgtctggt gatttttatt ttaagtgaac ctttggatct atctttaact ctctttattg 
tgagtggtaa attccaattc tgcagcagat cagtaaactc acagtatttt tcctgtggaa 
atctattcaa taaggaaacc aagacaggat aataaaattt aaaaaaaaac aactttgaat 
tcccctgcct aggtcttcca gttgttttcc agcgcatacc tcaggtatga ctttgctagc 
cggggacaaa attagcacct tccgattctc tagtccaaat gaactttgtg ctaaataaaa 

<210> 361 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 361 

gtagaacaga aaatgagcat ccgatttctt cactaaagga gaccaaactg ttccttgcgg 

tctagtattg aagaactgga acttgaaagt cctccttcta ccaactccac ctccaccccc 

tcattcccct tctcccaaag tactactgct gttgcatgac aaccccaaat atgttctgtc 

aacacaaacc tgcctttggt gtataaacag ggcattacag aatggtacac cctatatatt 

tctgttcagt atccattcac tagttcttca tttataaata tcatcttccc cattctgctg 

<210> 362 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 362 

actaccccgg ctacggttcc cccatgcctg gcagcttggc catgggcccg gtcacgaaca 
aaacgggcct ggacgcctcg cccctggccg cagatacctc ctactaccag ggggtgtact 
cccggcccat tatgaactcc tcttaagaag acgacggctt caggcccggc taactctggc 
accccggatc gaggacaagt gagagagcaa gtgggggtcg agactttggg gagacggtgt 
tgcagagacg caagggagaa gaaatccata acacccccac cccaacaccc ccaagacagc 



240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 



60 
120 
180 
240 
300 
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<210> 363 
<211> 271 
<212> DNA 
<213> Homo sapiens 



<400> 363 

ggcaattagc ctcgcttaag ttgccttttt tacacaccaa aactttttac atgaagggct 60 

ggtttcacat gaatactata ctgaaatctg tgctctcaag atctagcagt gaccagggct 120 

gcccggcggg ggctctcctg gcaagtcagg aaggtnnnnn nnnnnnnnnn nnnnnnnnnn 180 

canattantn nctgatcntc tntnangaan nnngantngc tctnttggnc nttgtnnnnn 24 0 

gncntnnnnt naantntttn ntnatgtngc t 271 



<210> 364 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 364 

agaggaccct gcagttaggg ggtgttactt tgtcgcccag gatggcctgg acccccaggt 60 

tcagggattc tcccgccgct gcttcctgag tagctgggac ctcaggcttc cgcctcgtgc 120 

ccgcatccct gctgtgttta ggcagcaggt ggtgacctca ctcctccctg gcctgagctc 180 

tccgtcccgc atcccaggcg gaggccctag ggaacacttt gaagctgagc acggggtgga 24 0 

ccctccctcc tgagtgaatg gagaatagaa agggagagga tttctgttct gttctgtggg 300 



<210> 365 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 365 

gttcttcaaa gccaaccaag acaggcttag cagttttaga gcttcagaac aaattgccaa 60 

aagccagagt tgtttatgct agtgcaactg gtgcttctga accacgcaac atggcctata 120 

tgaaccgtct tggcatatgg ggtgagggta ctccatttag agaattcagt gattttattc 180 

aagcagtaga acggagagga gttggtgcca tggaaatagt tgctatggat atgaagctta 240 

gaggaatgta cattgctcga caactgagct ttactggagt gaccttcaaa attgaggaag 3 00 



<210> 366 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 366 

gccagtcctc accttcccta gtcctcgtgt gtattttagg agatgcgtgg gtgtggaaca 60 

gcctcctgcc tccggtccag gtgtactggg gtctgtgtgt tgtgtttctg cgtgttctcg 120 

gcagaaagtg gcatgctgtc ccgcctgggt gatttgctct tttacactat tgctgaagga 180 

caggaacgaa tccctatcca caagttcacc actgcactaa aggccactgg actgcagaca 240 

tcagatcctc ggctccgaga ctgcatgagc gagatgcacc gcgtggtcca agagtccagt 300 



<210> 367 
<211> 300 
<212> DNA 
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<213> Homo sapiens 



<400> 367 

cattgccaga gagcggtttc agcaagctgc agatctgatt gatgctgagc aacgaatgaa 
gaagtccatg tggggtcagt tctggtctgc tcaccagagg ttcttcaaat acttatgcat 
agcatccaaa gttaaaaggg ttgtgcaact agctcgagag gaaatcaaga atggaaaatg 
tgttgtaatt ggtctgcagt ctacaggaga agctagaaca ttagaagctt tggaagaggg 
cgggggagaa ttgaatgatt ttgtttcaac tgccaaaggt gtgttgcagt cactcattga 



<400> 369 

gtggggtgtg cctcgtgtgc gtggattcgt gtgtgtgtgt gtgtcttgta tatgtgtgcg 
cagagtgcat cattttcaga ctctactatt tccgtcaagt attctgtttg atttggatca 
tctcaggatc ggattctgtt ttagagtgtt tctgggccag gatccgggcc cctgccctcc 
tctgcacctg accacactcc ctactcaggg ctagtctgtt cttcccggac atcttctggt 
agccgtgcag gagagggctg ggtggggcag aggccacaga ggggacctgg tgtgtcacct 



60 
120 
180 
240 
300 



<210> 368 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 368 

gcccggcccg gcgacgctgg cgacgctttc gcccctgagg tagtttggcg accgcgaaga 
aggaaaaagg gcgggcgggc ggctgtcctc tcaccgtcct caccccgcga ggcccggccc 
gctcctccgt cgtggatttc gcggcgatcc ccccggcagc tctttgcaaa gctgcttgaa 
acttctccca aactcggcat ggatacgact gcggcggcgg cgctgcctgc ttttgtggcg 
ctcttgctcc tctctccttg gcctctcctg ggatcggccc aaggccagtt ctccgcaggt 300 

<210> 369 
<211> 300 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 



60 
120 
180 
240 
300 



<210> 370 
<211> 273 
<212> DNA 
<213> Homo sapiens 

<400> 370 

cagaggctgg ttcagaaaag gaggaagagg cccggctggc agccctggaa gagcagagga 
tggaggggaa gaagcccagg gtgatggcag gcaccttgaa gctggaggat aagcagcggc 
tggcccagga tgaggagagt gaggcctagc gcctggccat tatgatgatg aagaagctnn 
nnnnnnnnnn nnnnnnnnnc atcatgtccn ntgcatggct acctatccca tatttnatnt 
ccctnncgtt gnttcnaatt ncacattntc ttt 

<210> 371 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 371 



60 
120 
180. 
240 
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<210> 372 
<211> 300 
<212> DNA 

<213> Homo sapiens 



<210> 373 
<211> 300 
<212> DNA 
<213> Homo sapiens 



<210> 374 
<211> 300 
<212> DNA 
<213> Homo sapiens 



60 



gatgaggagt gtttaatcat tgatacagaa tgtaaaaata atagtgatgg aaagacagct 
gttgtgggtt ctaacttaag ttccagacca gctagtccaa attcttcctc aggacaggct 120 
tctgtaggaa accagactaa tactgcttgt agtcctgaag agtcatgtgt tttaaaaaaa 180 
cctatcaaac gagtatataa aaaatttgat ccagttggag agattttaaa aatgcaggat 240 
gagctcttaa agccaatttc cagaaaagta ccagaattgc ccttaatgaa tttagaaaat 



300 



<400> 372 

gggccccaat gcagctgccc tctccagata cctggcagcc tcatatatca gccaaagcct 60 

ggctcggcgg caggggcctg ggggaggggc ccccgcagcc tcccggggct cctggtcctc 120 

tgctcccacg tcacgggcat cttcgccgcc cccccagccc cagccaccac ctcccgcagc 180 

caggcggctc agctatgcca cgacggttaa catccacgtg ggcgggggtg ggcggctgcg 240 

gccagccaag gcccaggtcc ggttgaacca ccctgctctc ttggcctcca cacagqaatc 300 



<400> 373 

accctttctg ccttctgttt gggacccagc tggtgttctt tggtttgctt tcttcaggct 60 
ctagggctgt gctatccaat acagtaacca catgcggctg tttaaagtta agccaattaa 120 
aatcacataa gattaaaaat tccttcctca gttgcactaa ccacgtttct agaggcgtca 180 
ctgtatgtag ttcatggcta ctgtactgac agcgagagca tgtccatctg ttggacagca 
ctattctaga gaactaaact ggcttaacga gtcacagcct cagctgtgct gggacgaccc 



<210> 375 
<211> 300 
<212> DNA 
<213> Homo sapiens 



240 
300 



<400> 374 

tcaaggccta cgaacaggtg atgcactacc ccggctacgg ttcccccatg cctggcagct 60 

tggccatggg cccggtcacg aacaaaacgg gcctggacgc ctcgcccctg gccgcagata 120 

cctcctacta ccagggggtg tactcccggc ccattatgaa ctcctcttaa gaagacgacg 180 

gcttcaggcc cggctaactc tggcaccccg gatcgaggac aagtgagaga gcaagtgggg 240 

gtcgagactt tggggagacg gtgttgcaga gacgcaaggg agaagaaatc cataacaccc 300 



<400> 375 

cttcagtgca cacaacagga gagaggagaa agaagaaacg ctagtaattc caagcactgg 60 
aattaagttg ccttcatcag tgtttgcttc agagtttgag gaagatgttg tgattgttaa 120 
ataaagcagc tccagtttca ggacctcgac tggattttga tcctgacatt gttgcagctc 



180 
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ttgatgatga ttttgacttt gatgatccag ataatctgct tgaggatgac tttattcttc 240 
aggccaataa ggcaacagga gaggaagagg gaatggatat acagaaatct gagaatgaag 300 

<210> 376 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 376 

gggagactgg ggtctatttc acccctgcag tctcgaccat aagagatggc tacacccagg 60 
ggggccagtt cagagaccca ctcccaggtg tgcattctct ttctcaagga tgttccttgc 12 0 

tgagaaaaag aattcagtga tatttctccc atttgcttgt gaaagaagag aaatgtggct 180 
ttgttccacc tggctcaccg gcggtcagaa tttaaggtta tctctcttgt ttcctaaaca 240 
ttgctgttat cctgttcttt tttcaaggtg cccagatttc atattgctca aacacacatg 300 

<210> 377 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 377 

gatcagccca cctcggcctc acaaagtgct gggattacag gcgtgagcca ccttgcccag 60 

cccacatcat acagtttgaa atgaaacttt gccacaacca gcctttgctg tagcacacac 120 

atatatcact gaacctgttt gaaataaagt tttttttctt tttcctctgg tattctgggt 180 

tctgaagtct ggtattctgg tattctgggt tcaaaagtat gacttgagag tgttgctctg 240 

gtattctgag agttgctctg tattctgggt tctgaagatt atttgaaaaa taactcctac 300 

<210> 378 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 378 

tcgctgtgat ccaaggataa aaaagttcaa ggaagaagaa aaagccaaga aagaagcaga 60 

aaagaaagca aaagcagaag ctaaacggaa ggagcaagaa gctaaagaaa aacaaagaca 120 

agctgaatta gaagctgctc ggttagctaa ggagaaagaa gaggaggaag tcagacagca 180 

agcattgctg gcaaagaagg aaaaagatat ccagaaaaaa gccattaaga aggaaaggca 240 

aaaacttcga aactcatgca agacctggaa tcatttttct gataatgagg cagagcgggt 300 

<210> 379 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 379 

acactataga atacaagcta cttgttcttt ttgcaggatc ccatcgattc gaattcggca 60 

cgaggcagct tcgagccaat ggtgagctcc ttctggatca gctccttcag ctccttcttg 120 

ctcaggatgc tgaaattgca aggctgatgg aagacttgga ccggaacaag gaccaggagg 180 

tgaacttcca ggagtatgtc accttcctgg gggccttggc tttgatctac aatgaagccc 240 

tcaagggctg aaaataaata gggaagatgg agacaccctc tgggggtcct ctctgagtca 300 
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<210> 380 
<211> 296 
<212> DNA 

<213> Homo sapiens 
<400> 380 

acctggacag ggccagctgc tgggggagcg gcactgggga ctggaggctg gaagcgggtg 60 
gtgtgtgtcc cctgtttact tttagctgag ctggggttgg gtgtacgggt tctgttcctc 120 
tgagccctgc ggcccacctg atgtttacgt gtgtgtgtga gggggggcnn nnnnnnnnnn 180 
nnnnnnnnnn ngtnatangc ttaacanatg nanagncnac tnactnctga ttntttatnc 240 
atttgtgcat tnnaactatg cttttncgat cttnctgntg nnatnacngg catgat 296 

<210> 381 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 381 

cagaaaagag tatagtaggg atgaccaagg tcaaagtggg taaagaagac tcatcatcca 60 

ctgagtttgt agaaaaacgg agagcagctc ttgaaaggta tcttcaaaga acagtaaaac 120 

atccaacttt actacaggat cctgatttaa ggcagttctt ggaaagttca gagctgccta 180 

gagcagttaa tacacaggct ctgagtggag caggaatatt gaggatggtg aacaaggctg 240 

ccgacgctgt caacaaaatg acaatcaaga tgaatgaatc ggatgcatgg tttgaagaaa 300 

<210> 382 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 382 

gccaccggtc tcttcctaat ctgcacagac tattttgggt atttctgggc gggcagttcc 60 

tttgcatgtt tcgggagagg tttgctgatt tggggcttat atgtcaggcc tttggtttgc 120 

gtcttatttt aggggttgtt tgggggcctg ggtggtcggc ctcacatggg aaggggatgg 180 

gtagtggatg gggtttctgt tgtatcttgt gggcgggtga ttttgctttt gtttttgttt 24 0 

cacattcttc cccctccaca agccaaagtc gtttcatttg gtttccactg tgtggactgt 300 



<210> 383 
<211> 273 
<212> DNA 

<213> Homo sapiens 
<400> 383 

gagatttgat attcgagtgc tgggcttagg tctgttgata aatctagtgg agtatagtgc 60 
tcggaatcgg cactgtcttg tcaacatgga aacatcgtgc tcttttgatt cttccatctg 120 
tagtggagaa ggggatgata gtttaaggat aggtggnnnn nnnnnnnngc cngcnttnac 180 
ttnatngcnn ctttttcttg atcnacgncn gnnatncnna nnngtntata ntaatncnga 240 
ananttnttt gnnntgcttt atcaantntt cnt 273 

<210> 384 
<211> 259 
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<212> DNA 

<213> Homo sapiens 
<400> 384 

aagagaagga cctagagatt gagaggctta agacgaagca aaaagaactg gaggccaaga 60 
tgttggccca gaaggctgag gaaaaggaga accattgtcc cacaatgctc cggccccttt 120 
cacatcgcac agtcacaggg gcaaagcccc tgaaaaaggc tgtggtgatg cccctacagc 180 
taattcagga gcaggcagca tccccaaatg ccgagatcca catcctgaag aataaaggcc 240 
cgaagagaaa gctggagtc 259 

<210> 385 
<211> 2 96 
<212> DNA 
<213> Homo sapiens 

<400> 385 

agagcctgca agtgacaaag gaagtgaggc agaggcccac atgcccccac cgttcacacc 60 

ctacgtgcct cggattctga acggcttggc ctcggagagg acagcactgt ctccgcagca 120 

gcagcagcag cagacctatg gtgccatcca caacatcagc gggactatcc ctggacagtg 180 

cttggcgcat agcgccacgg gcagtgtggc ttgctgcccc ccaggaggcc tgaggctggg 24 0 

tctcactgct ctgaaaaaga cccnnccaaa atgggccttg gggctnnagg cccttg 296 

<210> 386 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 386 

gaagaggagg ctgtgtatga ggaacctcca gagcaggaga ccttctacga gcagccccca 60 

ctggtgcagc agcaaggtgc tggctctgag cacattgacc accacattca gggccagggg 120 

ctcagtgggc aagggctctg tgcccgtgcc ctgtacgact accaggcagc cgacgacaca 180 

gagatctcct ttgaccccga gaacctcatc acgggcatcg aggtgatcga cgaaggctgg 240 

tggcgtggct atgggccgga tggccatttt ggcatgttcc ctgccaacta cgtggagctc 300 

<210> 387 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 387 

ccgcagaggg cctggaagag gtgct caeca cgccagagac tgtgctcaca ggccacacgg 60 
agaagatctg ctccctgcgc ttccacccac tggcagccaa tgtgctggcc tcgtcctcct 120 
atgacctcac tgttcgcatc tgggaccttc aggctggagc tgatcggctg aagctgeagg 180 
gccaccaaga ccagatcttc agcctggcct ggagtcctga tgggcagcag ctggccactg 240 
tetgeaagga tgggcgtgtg egggtctaca ggccccggag tggecctgag cccctgcagg 300 

<210> 388 
<211> 300 
<212> DNA 
<213> Homo sapiens 
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<400> 388 



tggaggtctc ctttcgcccc 
gagcacattt tgtagggtgg 
aagagaaaaa agtgaccttt 
tactgatttt ttttttctta 
tttcatactg tggggcaaag 



agcccaggtg gccaagccca tcctggcctc agaacatgct 
caccttttta tccaagttac tagctacaca tcagtgttta 
catttttttt tcttgaaact tgaggaaaca agatacatac 
aaactaaatg catgactgca gagcggtaga ggtgtatatt 
tatttgtgct gctttttgga gatggactgg aacgtctggt 



60 
120 
180 
240 
300 



<210> 389 
<211> 293 
<212> DNA 

<213> Homo sapiens 
<400> 389 

gtcaagctgg ccctggatgt ggagatcgcc acctaccgca agctgctgga gggcgaggag 60 
tgcaggctga atggcgaagg cgttggacaa gtcaacatct ctgtagtgca gtccaccgtc 12 0 

tccagtggct atggcggtgc cagcggtgtc ggcannnnnn nnnnnniiiinn nnnatgaanc 180 
agntactcct atggnnttag cnttntanct atnacctgcn cnaactannc tnangtgcta 240 
grmcttgccc caacccctac ttttgtattt atattgtgtg tgcgtgtgtg cgt 293 

<210> 390 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 390 

ctcacacctg ctttggatgc ttcaagcacc tcagccctct gaactacaaa acagangagc 60 

ctgcaagtga caaaggaagt gaggcagagg cccacatgcc cccaccgttc acaccctacg 120 

tgcctcggat tctgaacggc ttggcctcgg agaggacagc actgtctccg cagcagcagc 180 

agcagcagac ctatggtgcc atccacaaca tcagcgggac tatccctgga cagtgcttgg 240 

cgcagagcgc cacgggcagt gtggctgctg ccccccagga ggcctgaggc tgggtctcac 300 

<210> 391 
<211> 257 
<212> DNA 
<213> Homo sapiens 

<400> 391 

acccgtccgg ggccggccaa tttgcatatt tggaatgcgc cgctataaac ccggctgggg 60 

ttttgcagcg atttcttaga tgtaaaaatg agatctcaat agcagcgggc tgggcacatc 120 

ctctcctctc tccttctctc tctgcccgga gctggtttcc gtctctcggc tcggggctgg 18 0 

aactccggcc caacctaggc gcgcagccgc cacgagatgg cgcacttccg atcaatgtca 24 0 

aagccgccgg ggagccc 257 

<210> 392 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 392 

gcgcgagcgt cggctccgcc tgggcccttg cggtgcgctg cgggcaggcg gtgaggctca 60 
cgcatgtgct tacgggcaag aacctgcaca cgcaccactt cccgtcgccg ctgtccaaca 120 
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accaggaggt gagtgccttt ggggaagacg gcgagggcga cgacctggac ctatggacag 180 
tgcgctgctc tggacagcac tgggagcgtg aggctgctgt gcgcttccag catgtgggca 240 
cctctgtgtt cctgtcagtc acgggtgagc agtatggaag ccccatccgt gggcagcatg 300 

<210> 393 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 393 

gcgcgagcgt cggctccgcc tgggcccttg cggtgcgctg cgggcaggcg gtgaggctca 60 

cgcatgtgct tacgggcaag aacctgcaca cgcaccactt cccgtcgccg ctgtccaaca 120 

accaggaggt gagtgccttt ggggaagacg gcgagggcga cgacctggac ctatggacag 180 

tgcgctgctc tggacagcac tgggagcgtg aggctgctgt gcgcttccag catgtgggca 24 0 

cctctgtgtt cctgtcagtc acgggtgagc agtatggaag ccccatccgt gggcagcatg 300 

<210> 394 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 394 

gtccatacat ggagctccct ggagcccgtg tgctctcgtg tgactgaacg ttttgtgatg 60 

aaaggaggag aggctgtctg cctttatgag gagccagtgt ctgaattgct gaggagatgt 120 

gggaattgca cacgggaaag ctgtgtggtt tccttttacc tttcagctga ccatgaactc 180 

ctgagcccga ccaactacca cttcctgtcc tcaccgaagg aggccgtggg gctctgcaag 240 

gcgcagatca ctgccatcat ctctcacnag gngaccatat tggtttttga cctggagacc 300 

<210> 395 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 395 

gcaaaatcaa tgtggactga acataaatca cctgatggaa 
gaaaccaaac agtctacctg ggagaaacca gatgatctta 
ttatctaaat gcccctggaa ggaatacaaa tcagattctg 
tctcaaacaa aagaatctcg ctgggccaaa cctaaagaac 
atcaaagctg aagaaagcag taagcaagaa gagtgcacca 



<210> 396 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 396 

aagagcacaa gaggaagaga gagaccctca ctgctgggga gtccctgcca cactcagtcc 60 

cccaccacac tgaatctccc ctcctcacag ttgccatgta gaccccttga agaggggagg 120 

ggcctaggga gccgcacctt gtcatgtacc atcaataaag taccctgtgc tcaaccaaaa 180 

aaaaaaaaaa aaaaacnnnn nnnnnnnnnn nntntngggn gnctnntnnc nnaaanccan 24 0 
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ggacttacta 


ctacaacact 


60 


aaacacctgc 


tgagcaactc 


120 


gaaagcctta 


ctattataat 


180 


ttgaggatct 


tgaagcaatg 


240 


caacatcaac 


agccccagtc 
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ncttnataaa anccttngnt natttggaac aacccncann taaanngcag ggaaaaaaag 300 

<210> 397 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 397 

gataaatacc tcagcccctc gccttcctca acccacctgg caagtcttct taggatctga 60 

tcccagtttt ctggaagcaa tcctacccca gcccaagctt cccagagtcg agccttaatc 120 

cttctcactt ctcagtgtca gagcagaaat gaatcctggg gttgactgtg tccattcggg 180 

ttattagcag ctaagaagcc cagacgagta gtgtgagctg ccttgggagc ctcagtgagg 240 

gcactgggac tggcctcact ctcttgcccc cagcctagtg ggctttctcc tctgtctctc 300 

<210> 398 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 398 

ctgaacccta aaggaaagcc agcaaaccag ctgcttgctc tcaggacttt ttgcaattgt 60 

tttgttggcc aggcaggaca aaaactcatg atgtcccaga gggaatcact gatgtcccat 120 

gcaatagaac tgaaatcagg gagcaataag aacattcaca ttgctctggc tacattggcc 180 

ctgaactatt ctgtttgttt tcataaagac cataacattg aagggaaagc ccaatgtttg 240 

tcactaatta gcacaatctt ggaagtagta caagacctag aagccacttt tagacttctt 300 

<210> 399 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 399 

gctgacctac agcagaagct gctggatgca gaaagtgaag acagaccaaa acaacgctgg 60 

gagaatattg ccaccattct ggaagccaag tgtgccctga aatatttgat tggagagctg 120 

gtctcctcca aaatacaggt cagcaaactt gaaagcagcc tgaaacagag caagaccagc 180 

tgtgctgaca tgcataagat gctgtttgag gaacgaaatc attttgccga gatagagaca 240 

gagttacaag ctgagctggt cacaatggag caacagcacc aagagaaggt gctgtacctt 300 

<210> 400 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<400> 400 

ggctagcgat ttctacctgc gctactacgt agggcacaag ggcaagtttg ggcaccgagt 60 

ttctggagtt cgaatttcgg ccggacggaa agcttagata tgccaacaac agcaattaca 120 

aaaatgatgt gatgatcaga aaagaggctt atgtgcacaa gagtgtaatg gaagaactga 180 

agagaattat tgatgacagt gaaattacaa aagaagatga tgctttgtgg cctccccctg 240 

atagggttgg ccgacaggag cttgaaattg taattggaga tgagcacata tcttttacca 300 

<210> 401 
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<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 401 

acccccttca tggacagatc ccccacagcc tggggcagaa gaggcgtcga gggcgccaga 60 
agtggcggca gcagcagccg cagcagccaa agagaggcaa gagaaagaga aagcgggcgg 120 
tggaggggtc ccggaagagc tggtccccgt ggttgagctg gtccccgtgg ttgaattgga 180 
agaggccata gccccaggct cagaggccca gggcgctggg tctggtgggg acgcgggggt 240 
gcccccaatg gtgcagctgc agcagtcacc actagggggt gatggagagg aagggggcca 300 

<210> 402 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 402 

ggatcctttc cagacagaag accccttcaa atctgaccca tttaaaggag ctgacccctt 
caaaggcgac ccgttccaga atgacccctt tgcagaacag cagacaactt caacagatcc 120 
atttggaggg gaccctttca aagaaagtga cccattccgt ggctctgcca ctgacgactt 180 
cttcaagaaa cagacaaaga atgacccatt tacctcggat ccattcacga aaaacccttc 240 
cttaccttcg aagctcgacc cctttgaatc cagtgatccc ttttcatcct ccagtgtctc 

<210> 403 
<211> 300 
<212> DNA 

<213> Homo sapiens 



60 



300 



60 
120 
180 
240 



<400> 403 

aattccgttg ctgtcggaca gattgcccta gtacccaccc acctatcagg gttatgcaat 
ggaacatcct cgcccaagct cttggagaag gcaaagacaa ctttgtacag tgccctgttg 
aagcactcaa atgggaagaa aggaaatgtc tcatcctgga agaaatcctg gcctaccagc 
ctgatatatt gtgcctccaa gaggtggacc actattttga caccttccag ccactcctca 
gtagactagg ctatcaaggc acgtttttcc ccaaaccctg gtcaccttgt ctagatgtag 300 

<210> 404 
<211> 300 
<212> DNA 

<213> Homo sapiens 
<400> 404 

agtgggataa aatgagacga gccctggaat ataccattta caatcaggaa cttaacgaga 60 
ctcgtgccaa acttgatgag ctttctgcta agcgagagac tagtggagaa aaatccagac 120 
aattaagaga tgctcagcag gatgcaagag ataaaatgga ggatatcgaa cgccaagtta 180 
gagaattgaa aacaaaaatt tcagctatga aagaagaaaa agaacagctt agtgctgaaa 
gataagagca gattaagcag aggactaagt tggagcttaa agccaaggat ttacaagatg 

<210> 405 
<211> 856 
<212> DNA 

<213> Homo sapiens 
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<400> 405 

tggtgccngt tcctattccg tgctntcgtn ctncnccagg ancnangcgt ntcgaattcg 60 

gcacgaggaa ggaggaccta ggcacacaca tatggtggcc acacccagga gggtagtggg 12 0 

gagttagatt tcagagtcca ggccctaggt tgggacccac tccaaataat ctcctcggtg 180 

tgggtggtgg ttctatagag ggataaatga ataataaaca ttgttaaaat atacgaaaaa 240 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaan aaaaaaaana aaaaaaaaaa anaanaaaaa 300 

aaaananaaa aatnaaaaan annanaaaaa aaaaaaaaaa aanncccctn cnccctaaaa 360 

nattcngggg ggntttttcc tccannccnn ntntttaata nnctncttnt tgnntcttng 420 

nctcaccnnt tcttttggtn ggcnntaana naaaatnttn nttttttttn ggntanaaat 480 

ncnntnncng ttttttntnn ttttttttcn aaaccctcct nttntanctc ncgtntcnaa 540 

aaanntnttt ntccncnncn nttnnntnnt nctntttcta tttttnnttc ttntncaann 600 

ttccnangtg nnnngngtnt nntgnggctt gtttnttttt ncnncctngc gtcatccnnc 660 

caataatttc ttnncncccc nannccnnat tttttntnnc ctctatntnn gnngngrmat 720 

atnantcccc tttattnttn atnantagtc ntntnttttn ttntccntng tnatannatt 780 

ttntntcccn ntntaanttc ctcannnnat ttnntnnncn ncgngntata tttnangnta 84 0 

nntcnncggg gttnct 856 

<210> 406 
<211> 843 
<212> DNA 
<213> Homo sapiens 

<400> 406 

tnntnnnnnc gnangctggn nnnttctncc cntttctaat ngttnctaat actanggatn 60 

gtcacgaggn tcccangtag gcatagcgca ctgctgtacg tttttggttg tttttaagaa 120 

actcgatgaa gaggggtgtc attctgggct cggggtggtt gccaattttt caccagaaag 180 

ggagccaccc cttgcaacca cttctgtctc cgttagcccc ccctctgccc tcctccaagc 240 

caaagcgtgg cctggctttt gtcttcccat ttagttttcc tcttttaccc ttccttttgt 300 

gcttaattta ttaaaatagt tgctgtataa tttattttca taaactataa aaaaatacta 360 

aatggttaaa atagacttgc aggccaatct taaatggggt gggaggggtc tgagggtggg 420 

atggggaaag ggaaagaggt tttgatntaa acaaaacaaa tgcactttgg gtgtgtnnng 480 

gnattttnct ggggatanan ggggtggggg nnnnngnann nnnnnnnnnn nnnnnnnnnn 540 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 660 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 720 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 780 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 84 0 

843 

<210> 407 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 407 

tgggcnggnn ctantngngg gctctcgact tcntacganc ttccaatggt tnggnggcac 60 

gagcccccat cttcactggt tattccactt atttaaaatg tccagaataa gcaaatctcc 120 

atatagagga agtagattag tggttgcttc gggatgggag gaatgggaag attgaggtct 180 

ttcttttgca gtgataaaaa tgtcctaaaa ttgactgtag cgatggtcac acaactctga 240 

atatgcttaa gaccattgaa ttacacactt tacgttggtg aattgtatgg tatgtaaatt 300 

atagttcaat aacatagtta caaaagataa tcaaaagcat gaaagcactg ttgatgtggt 360 
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ttggatctgt gtcctcaccg agtctcatgt tgaaatgtaa gccccctggt gggaggcgat 
gggattatgg ggcagagtcc tcacaaacgg tttacaccac ccgctcagtg ctggtctcct 
gatattgagt cctcatcaca tctggttgct tcaaagtgtg tggtgcctcc cctctatctc 
cctnctgctc tggccatata agatgtgcct gcttctcttc gccttctaac atgattgnaa 
gtttcctgag gcctncctag aacaaaactg ctgtgctttc tgnncccatc tacaggaccc 
ggagccaatt naaccccttt tctttataaa aaaaaannnn rinnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnt 



420 
480 
540 
600 
660 
720 
743 



60 



480 
540 
600 
660 
720 
746 



<210> 408 
<211> 746 
<212> DNA 

<213> Homo sapiens 
<400> 408 

tgtccgnttc ttangntgng ctctngcttc ctaggagtnt cnaatcgctt ggtgcagacc 
tccagcacaa gcctctgcta gtngatctca cggtagaaga aggtcaaaga ttaaaggtna 120 
tttttggttc acacactggt ttccatgtaa tngatgttga ttcaggaaac tcttatgata 180 
tctacatacc atctcatatt cagggcaata tcactcctca tgctattgtc atcttgccta 240 
aaacagatgg aatggaaatg cttgtttgct atgaggatga gggggtgtat gtaaacacct 300 
atggccggat aactaaggat gtggtgctcc aatggggaga aatgcccacg tctgtggcct 360 
acattcattc caatcagata atggctgggg cgagaaagct attgagatcc ggcagtggaa 420 
caggacattt ggatggagta ttnatgcata agcgagctca aaggttaaag tttctatgtg 
aaagaaatga taaggnattt tttgcattcc gtgcgatctg gaggaagtag cccaagtgtt 
tttcatgacc ctcaacagaa attccatgat gaacctggta accagaagaa ccccttggca 
cttatcttca tggcgttatt ctaatttaaa aagaacataa ctcatgngga cttatgccca 
gtctagaggc agaatcagaa ggcttgggtg gaacatatcg ntttcctttt tcctttcctt 
cggccctncc agnccagtcc atnttt 

<210> 409 
<211> 761 
<212> DNA 
<213> Homo sapiens 

<400> 409 

ggatccggtt tccaatgctc gggcnctcga gctncctaag annttgctaa tgcttggngg 
ngtgcctcgt gtgcgtggat tcgtgtgtgt gtgtgtgtct ngtatatgtg tgcgcagagt 
gcatcatttt cagactctac tatttccgtc aagtttctgt ttgatttgga tcatctcagg 
atcggattct gttttagagt gtttctgggc caggatccgg gcccctgccc tcctctgcac 
ctgaccacac tccctactca gggctagtct gttcttcccg gacatcttct ggtagccgtg 
caggagaggg ctgggtgggg cagagccagg aggggacctg gtgtgtcacc tgcccaccac 
ctggctcatc cctcangccc accctgaccc tacattacat aggttacgtc agcctactgt 
ggctgttgag caaagcattt ctcctttctn gggcctcatt gcactagatg ggcctgtggt 
cccaaagtag gtcagtaggt tggggttgct gacacccctt gggtgcaact ttgggacaag 
atgaantggc tctgtcctgt cactggcctc tccttgcctg ggggctatgt gcacttcaaa 
accctggcca agctcaagcc catgaagnat tggagaacac cctgggcccc caagaactgg 
angcacccgg ccanttcccc tgggattcca nctttgccan ggtgaaccct tcttttaccc 
naaacttntg tccccctgnt tccacttcca aaaanaactg g 

<210> 410 
<211> 748 
<212> DNA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
761 
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<213> Homo sapiens 
<400> 410 

gatgccggtt cctatgatgn gctctcggct tcctaggagt tccaanactn ggctngcncg 60 

aggncttnta aatatatctn ggntttanta ggtgataagt nctgtcantt agtancatct X2 0 

gaaaaancag ctttgtcctg ggtgaaaaag gatgccaaaa ttgcctggaa aagagcagtg 180 

anaggagtcc gggagatgtg tgatgcntgt gaagcancat tgtttancat tcactgggtc 240 

tgccaaaaat gtggatttgt ggtctgctta gattgttnca aggcaaagga aaggaagagt 300 

tctagagata aagaactata tgcttggatg aagtgtgtga agggacagcc tcatgatcac 360 

aaacntttaa tgccaaccca aattatacct ggttctgttt tgacagatct tctagatgcc 420 

atgcacactc ttagggaaaa atatggtatt aaatcccatt gncattgtct aacaaacaga 480 

atttacaagt tggaaatttt cctncatgaa tggtgtatct caagtttaca gaatgtctta 540 

atcacagtat aaaattctct gngcatgcct gagtctcagc gccaaaatcc tcctccgaag 600 

tctgagaaaa atggtggcag cnnccccana aagtgatgtt nggcnccaga ttaccaggtt 660 

aacttcctcc agaatnccag tcaccactgn actggntagc anatcttgcc gagccaaaaa 720 
gcccnaagng ggaaaaaaaa aaaaaaaa 



748 



<211> 773 
<212> DNA 
<213> Homo sapiens 

<400> 411 

gnangnnngn ttcntagtgc ccgtgggagt cttagatncc ctaaaaaatt gntaatgctn 60 

ggtcggcacg agtcaaggcc tacgaaacag gtgatgcact accccggota cggttccccc 12 0 

atgcctggca gctnggccat gggcccggtc acgaacaaaa cgggcctgga cgcctcgccc 18 0 

ntggccgcag atacctccta ctacoagggg gtgtactccc ggcccattat gaactcctct 240 

taagaagacg acggcttcag gcccggctaa ctttggcacc coggatcgag gacaagtgag 300 

agagcaagtg ggggtcgaga ctttggggag acggtgttgc agagacgcaa gggagaagaa 360 

atccataaca cccccacccc aacaccccca agacagcaat cttcttcacc cgcttgcaac 420 

ccgttccgtc ccaaacagag ggccacacag ataccccacg ttctatataa ggaggaaacc 480 

gggaaaagaa tataaagtta aaaaaaaagc ctccggtttc cactactgng tagacttcct 540 

gcttcttcaa cacctgcaga ttctgatttt tttgttgttg gttgttctct ccattgctgn 600 

tggtgcangg aagtcttact taaaaaaaaa aaaattttgn gagtgactcg gtgtaaaacc 660 

atgttanttt taacagaacc nanaagggtt gncctattgg ttaaaaaaaa aaaaaaaaaa 720 

aaacttngng cctttagaac tattanngag nccnatttac nttaatccan net 773 

<210> 412 
<211> 774 
<212> DNA 
<213> Homo sapiens 

<400> 412 

gnanncegga ttcntagcgn tcgtggaagt geateggctg ntaacaattt getaatgett 60 

ggagttccaa ttccagagaa aatgagtgaa tgggcacctc gacctccccc agaaatttgt 120 

ccgagatgtc atgggttcaa gtgctggggc cggcagtgga gagttccacg tgtacagaca 180 

tctgcgccgg agagaatatc agegacagga ctacatggat gccatggctg agaagcaaaa 240 

attggatgea gagtttcaga aaagactgga aaagaataaa attgetgeag aggagcagac 300 

cgcaaagcgc eggaagaage gecagaagtt aaaagagaag aaattactgg caaagaagat 360 

gaaacttgaa cagaagaaac aagaaggacc cggtcagccc aaggagcagg ggtccagcag 420 

etctgeggag gcatctggaa cagaggagga ggaggaagtg cccagtttca ccatggggcg 480 
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540 

600 



atgacaatgt ttgccacagc ctctgcctgg aacctggctc gtgctgtgac cagaagggaa 
aggcggctgt ttggctcttt cttccccgca aggaccccgc ttacccgctg gatggagagc 
aaaggagacc cccttccgag cccgntcaca gtcctgtatt tggcaaggtt tgggaacctg 660 
aaggggccaa tntnccttga cacttanang cacttgcctt tcagacacca ttccgngcnt 720 
ctggtaaaag gggacaagaa aagccttaac cttggcnnca tattttgaca gggg 



774 



<210> 413 

<211> 773 

<212> DNA 

<213> Homo sapiens 

<400> 413 

gnngnnnnnn tttctaatgc ttgggnnnnn ngtcnatgcn taagagccan gcggntcgaa 60 
ttcggcacga ggcgggcccg gccagcggaa gcccctgcgc ccgcgccatg tcaaagaaaa ^ ™ 

aaaggactga gtgcagaaga aaagagaact cgcntgatgg aaatattttc tgaaacaaaa 
gatgtatttc anttaaaaga cttggagaag attgctccca aagagaaagg ctttactgct 
atgtcagtaa aagaagtcct tcaaagctta gttgatgatg gtatggttga ctgtgagagg 
atcggaactt ctaattatta ttgggctttt ccaagtaaag ctcttcatgc aaggaaacat 360 
aagttggagg ttctggaatc tcagttgtct gagggaagtc aaaagcatgc aagcctacag 420 
aaaagcattg agaaagctaa aattggccga tgttgaaacg gaagagcgac caggcttagc 480 
aaaagacttt cttcacttcg agaccaaang ggaacagcta aaggcagaag tagaaaaaat 540 
ncaaagactg tgatcccgca agttgtngga agaaatcgcc aagcaaatna agtagcccaa 600 
ggaactgctt acagatggac tgattacata ttcgcaataa aatcttnggc ccaaagaaaa 660 
atttngggtt tgaaggaaaa ttaaattggt tngaaccttt tggaatttcc cgaaagactt 
ttgcctncnt ngacttaaaa tatttccatg gnggtgaaag gttgtccaan ctt 



120 
180 
240 
300 



720 
773 



<210> 414 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 414 

gnagnnnnnn nttctaatgc ttggggnnnn nngtcaatnc ctnngancna ggcggntcgc 60 

tcatccagaa angtcagatc ancaaagaag tccangaaaa antgcgaccc agctngaccn 120 

tttgatccca ggcttagcac acgattgcat ggcntcccct ttagccactt naaccactgc 180 

agacntccag gaagctggac t etc tec tea gtccntccag acttctggcc accacagant 240 

gaaaacccca ttttcaactg agctatcttt gctccagcct gatactccag actgtgctgg 300 

agatagtcat accccactgg ctttttcctt caccgaggac ttggaaagtt cttgtttgct 360 

agaccgaaag gaagaaaaag gggattctgc caggaaatgg gaatggcttc atgagtctaa 420 

gaagactatc agagtatgga gaaacacacc aaactacctg gggacaaatg ctgtcagccc 480 

ttaggcaaga ctaaattgga aagaaaggtg tetgecaaag aaaacaggca ggcccctgtc 54 0 

ctccttcaaa catacaggga atcctggaat ggagaaaaca tagaatcagt gaaacaaacc 600 

cgtagtccag ttctgngttt tcctgggata tgaaaagaat gaccanggac tnctggagtc 660 

aacttttcac ttgaagaatc tcaaggccac eggtcattgg ccacacactn gaactccttt 720 
ttaagatgta cccattactg gaattgggct taggg 



755 



<210> 415 
<211> 852 
<212> DNA 
<213> Homo sapiens 

100 



BNSDOCID: <WO 9933982A2J_> 



WO 99/33982 <w PCT/US98/27610 



<400> 415 

gnagnaannn ttctaatgct tgggnnnnnn ngtcaaacct tannaacctg gcntgncgaa 60 

ttcggcacga ggtcacaggc aggtttantg gccagtttaa aacttatgct ntctgcgggg 120 

ccnttcgtag gatggtgagt gtttccctgg gctttgctca tcacttcggg acatcgtgga 180 

ctttaccgtg cgcnttggag tgtgtgatgg tgcctgagta gatctgctgg cagagtagtt 240 

tgagccagct ggactgggct ggccgcctgc cgcttcttga gggtggaaga ggggtgctct 300 

gagaagacac tcaggcagca gactctgcct ctcactagga ggtgcccccc cgaccccgtt 360 

ccaccatagt caaggctgca ggctgccccg ggagaagtgg ctccccttct tgcgcctgtc 420 

ttccattcgc ttcaccgggg gganaagacn ttgggcttgg ttggcacagc ntgacccttc 480 

tgcccatctt naaggcagnc ccggaantgg gaaaaatatt tctttaaatg gtggcctttn 540 

nttttttttt nctttnaaag gggttgaagt tccannaatg natttcccaa tttccttccc 600 

gaattgggnc ccaaagggcc ccaatggggc antcggtcct ttaaaaagna acctttttgg 660 

acctgggaag aagaaaatca cccagattgt tgggaaatat tttggncatt aaaataaant 720 

aatggaaaac ctnaaaaaaa aaaaaaaaaa aaaactcgag cccnttaaaa acttttagtg 780 

agtcnnatta ccnttanatc canacnttga tangaanctt tggataattt tgggncaaac 840 
cnnaacttng at 

852 

<210> 4i6 
<211> 754 
<212> DNA 
<213> Homo sapiens 

<400> 416 

ggnnnnnnng tnaaaccttc cnaannaggc tnggcgtcac tgnccccggt caacaaaccc 60 

acttttatga cagttttctt ccgcagcttg gctnttaaat tttactggca ggtgtatggt 120 

tgttggaggg ttcctagtga gttgggggac ctggcantan agctgcttgg ttggaggaag 180 

tgaanctggc ttantaccag cagctgatct cttccacgtg ctgctgcttt ttttgccact 240 

ctgatactaa accagagaaa gctgcaggtg gataaagaag ctgtggctgt tttttgcttt 300 

tgggtggcaa tgagaaagag tcacagtgtg ggttaaaggg atctgcagtg gggccaagga 360 

tgccacccca ccctcagctg tangcaagct tgcacataaa taacccccgt cagtggagtg 420 

ttcgggatgc agggggcant atagtgttct tggactttgt ccgtcctggg gcagttttta 480 

agttctttat atttaagtgg ggtcagtgcc aagtgctacc actttcccaa taaangaatg 540 

ggggacccan aaggctgggg tccctggcta ccttgttatg aaggttttgn tntttctctg 600 

acaaganttg ctttggaaag ancctgtttt taggggatta ttttttgnat accccgatgg 660 

gganccaggg ttctnctcaa aacccttaca acccttagga tcatagggaa aaggggcccn 720 

tnttttnctg ctggcttncc caacttaaaa acnt 754 

<210> 417 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 417 

ngtntatagc ttnctaatgc ttcntancga attcggancg agagaagccn tgagcagcaa 60 

agtctntcgc gacaccctgt acgaggcggt gcgggaagtc ctgcacggga nccagcgcaa 120 

gcgccgcaag ttcctggaaa cggtggagtt gcagatcagc ttgaagaact ntgatcccca 180 

naaggacaag cgcttttcgg gcaccgtcag gcttaagtcc actccccgcc ctaagttctc 240 

tgtgtgtgtc ctgggggacc agcagcactg tgacgaggct aaggccgtgg atatccccca 300 

catggacatc gaggcgctga aaaaactcaa caggaataaa aactggtcaa gaagcttggc 360 

caagaagtat gatgcgtttt tggcctcaga gtcttttgat caagcagatt ccacgaatcc 420 

tcggcccagg tttaaataag gcaggaaagt tccctttcct gtnacacaca acgaaacatg 480 
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gtggccaaag tggatgangt gaagtncaca atcaagttnc aaatgaagaa ggtgttatgt 54 0 

ctggctgtan cttgttggtc acgttgaaga tgacnngacg atgaancttg gggtataaca 600 
ttcacctggc tgtcaacttc ttggnggtca attgcntcaa agaaaaaact tgggcagaaa 
tgttccnggc cttatnttnt caagaaccnc catggggcna agccccaacg ccctttnttt 
aaaggcncat ttggaattaa attcntnttt ncccg 



<210> 420 
<211> 739 
<212> DNA 
<213> Homo sapiens 



660 
720 
755 



<210> 418 
<211> 757 
<212> DNA 
<213> Homo sapiens 

<400> 418 

tggggnntnn nttctaatgc tgggatgttc taaangntgg gctactcgtt ctttccgcag 60 

gancccntcg attcgaattc ggcacgagga aaggtggcgc gcttctcacg gctgagttgc 120 

tgcgcctgca gacggaagct ccccacaggc agagctgctt ggatgtgtga gtcatgaagc 180 

cagagaagcc ccgctccatg agcagtgact ccccaggccc tgtgacctcc ctcctgtctt 240 

gcagctcctc ctggcaccag tccccagggc tctcctgttg gtagttcctg cttttcttct 300 

tggaaattcc tcgtggacct cgagatcttt accctaaaat agttctgttg aatttcaccc 360 

tggcaatgta aattgatagc ttatcttcac agatgccaga caatggacaa ctcaccatca 420 

gtcctctgct cacctgagac aaatgcatgt ctgattgctt cctctgccct attgnttatg 480 

tgaaaatgca gattcactga gccagactaa ggcatcagtg actgttcctc tactgcctct 540 

cacatggaga ttgtgtattc agtgaaaggc tgatcaaaga ccccaaagga atgcaccagt 600 

ttatctctta tctacctatg acctgcgagc tgnccaccac ccccagttgt tgcgcctttc 660 

cagacagaac cagtgtcatc ttacacgtat taattggatg tcctgngnct tccttaatat 720 
gtatcaaaac aagctngcct tgaacacctt gggcacn 

<210> 419 
<211> 738 
<212> DNA 
<213> Homo sapiens 



757 



<400> 419 

gnnngncgtt cnaattncgn ggnntctttc tngccnanna nnannngcgt gngngaattc 60 

ggcacgagac tgttcatcct aagttccact ataaacaggc tcatgactcg ggcacagaca 120 

cttcttgcgt gactttttcc tatgatggta atgtccttgc ctctcgtgga ggtgacgatt 180 

cattaaaatt atgggacatc cgacaattta ataaaccact tttttcagcc tcgggtcttc 24 0 

ccaccatgtt cccaatgact gactgctgtt tcagtccaga tgataagctc atagtcactg 300 

gtacatctat tcaaagagga tgtggcagcg gcaaacttgt tttctttgag cgtaggactt 360 

tccaaagggt gtatgaaata gacatcacag atgcgagtgt tgttcgctgc ctgtggcatc 420 

caaagctgaa ccagatcatg gttggaactg gaaatggatt ggctaaagtc tattacgacc 480 

ccaacaagag tcagagggga gcaaaattat gtgtggttaa aacccancgg aaggcaaaac 540 

aagctgagac tctactcagg actacatcat cacccctcat gccttgccta tgttcccgtg 600 

agccccgnca acggagtaca aaggaaacag ctggagaagg acagactgga tccctgaagt 660 

cgcattaacc tgaacctcct gtancangcc cangtcgtgg tggccgattt ggaacccacg 720 

ggggcactnt tttttcct 738 
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<400> 420 

gcgntnntat tagcgtgggc tcgntctcgc tcnacncanc nngngctggn cgaattcggt 60 

acgagaatca gaggaggctt cttcatcctt caactccatg atgaactcct atatgaagtg 120 

gcagaagaag atgttgttca ggtagctcag attgtcaaga atgaaatgga aagtgctgta 180 

aaactgtctg tgaaattgaa agtgaaagtg aaaataggcg ccagctgggg agagctaaag 240 

gactttgatg tgtaactgtg ctgttgatga agtcctccca gggaagcctg tgcagatgca 300 

gtcacctgga aagaacagag attccctttc acctacctca gcaaaacaaa ctttcaagtc 360 

ttgatagact tagcctagta attttatagt gagagtttca aactatatat caagtgtcta 420 

tagcatcaaa aacttctggg ggcgtggggg aaagtagaat accaagtata atagttacat 480 

tcactttcaa agagcatcta tgaatttgcc ttttgtaact tactgtggct ttaaacatat 540 

tcagaacaga tgcttgaaat atgcacttag cactttggtt ccacatctgt ctgggtaaac 600 

catgaagaaa atgaagctgc tgcctcaatc gancccagac agcagccata ggcagataaa 660 

gatttnggtt cacccttggt ggtgggaggc atcgtgtgtg cctttttttc ctctaatatc 720 
aattttacag tccgggaan 



<210> 421 
<211> 727 
^.212 > Dim 
<213> Homo sapiens 



<210> 422 
<211> 753 
<212> DNA 

<213> Homo sapiens 



739 



<400> 421 

gtgatctttn tgagtggggg ccntnctngc tctannanat aggttnggng ggctagcgat 
ttctacctgc gctactacgt agggcacaag ggcaagtttg ggcacgagtt tctggagttc 

gaatttcggc ccggacggaa agcttagata tgccaacaac agcaattaca aaaatgatgt 180 

gatgatcaga aaagagctta tgtgcacaag agtgtaatgg aagaactgaa gagaattatt 240 

gatgacagtg aaattacaaa agaagatgat gctttgtggc ctcccctgat agggttggcc 300 

gacaggagct tgaaattgta attggagatg agcacatatc ttttaccaca tcaaaaatag 360 

gttctcttat tgatgtaaat caagtcaaag gatcctgaag gccttcgagt attttactat 420 

ttggtacaag acttgaaatg tttagttttc agtcttattg gattacactt caagattaaa 480 

ccaatttaaa ttgtatgttt tcaagctggt tgnatattta attaaaggga tgggaagggg 540 

ttatttgtca tttacagtat tggggtttta tgaatgtgaa gcaaccaaaa aaaattnnaa 600 

tgtaaaactg gaaaatagga aaattcatta ncagcttaat gggtatcctt acttgatncn 660 

ctgggtttgg aagtccccac acacattaaa tctgtaatga aancnctttt ggttaaaatt 720 
tctctat 



60 
120 



727 



<400> 422 

gtntngnnng nngttnnatt atatggntcg nctnnctcna nnancnangc ttgngctgac 60 

aacttgattg ggttctcctt caggtttgaa gcgccctcna gaagtgtcta aaggagacag 12 0 

ttgatagcca aacaacagtt ttggattcac tgactgatta tgaaagaagc agtagactgg 180 

tatcaagaat cagtcagcaa ggaggccctc accagacgcc agtgccatgt tcttggactt 240 

ctcagcctcc atattcatga actaagtttt tggaatcctt aggcttccac gtgtggaaag 300 

cctgagctaa cctactggag gatgagccat cacctggagc agattcaggc catcctagtt 360 

gaagcctccc taggccaagc aaccgtccaa ctaccagaca ttgaccattc agccttgaac 420 

attcagcaca aagacaaaac agaccagacc agaagagtcc cacagaatag gggaaactat 480 

tcagagaaaa cttaagccac taagttttat ggtgttttgt tcttgtagcc agaagcatag 540 
gcatactggc caatacaaac cgaaatcctt ctaacgtant ggaccctttt caggccagca 



600 
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ttttttccct tgaaaacctg ggagccttgt attccatctt attagcagaa gatcactttc 
accaatggtt tgggctcttg atttggaatt gatgatgtaa tgagcctnta ttcnanatgn 
gacttaatac ctctgcgaat tgactggatt ccn 



accn 



<210> 424 
<211> 799 
<212> DNA 

<213> Homo sapiens 



<400> 424 

ggagnnnngn ntccnaattn nntgggnnnn nnngtcaaan nctngctact cgttctttcc 
gcaggatccc atgcgattcg aattcggcac gagcccagac ctatggagtc agacagtagg 
tttgaggccc agcaatctat ggtttaacaa gccatccagg tgtttctgat gcacagtgaa 
attggggtac cactggtatt aggtttggta tggcaacttt ttcatcactt gttttatgta 
gttgtctgat caattgtgaa aacataatga atgttggaaa tggaacagta aaataacgaa 
agccaacttt tttttttttt tttgagacgg agtcttgctc tgtcgcccag gctggagtgc 
agtggcgcga tctcggctca ctgcaagctc cgcctcctgg gttcacgcca ttctcctgcc 
tcagcctccc gagtagctgg gactacaggc gcccgncacc acgcccggct aattttttgn 
atttttagta gagacggggt ttcaccgtgt tagccaggat ggtctcgatc tcctgacctc 
gtgatccacc cgcctngggc ttccaaagtg ctgggattac aggcgtgagc caccgggccc 
gggccaaaag ccaactcttt atgcctagaa aatattgtgc accctatgac ccaagcccat 
tgaatttttn cngggaaatt tatggtaaat tattgaaatg gatggtacct ttaaaaagtt 
atttggcaca ttccccttgg gttacctttg gnatggtttg ccagggaatt naaaactttg 
ggntnaaacc ttttttann 



<210> 425 
<211> 750 
<212> DNA 
<213> Homo sapiens 



660 
720 
753 



<210> 423 
<211> 844 
<212> DNA 
<213> Homo sapiens 

<400> 423 

nggnnnttnn nnnnnatncc ntgatcgtgt ntcgttcttt ctncaggatn nrmtcgtttc 
gaattcggca cgaggaaaag ggagccgcgc agngcctacg ggagtncggc ggcagcagcc 
ggtaccggca accacgggca gctctcaggg aatctccgtc gttgaggcca naggctccag 
tccccgcgag tccagatgcc tgtccagcct ccaagcaaag acacagaaga gatggaagca 240 
gagggtgatt ctgctgctga gatgaatggg gaggaggaag agagtgagga ggagcgganc 
ggcagccaga cagagtcaga agaggagagc tccgagatgg atgatgagga ctatgagcga 
cgccgcancn agtgtttcag tnagatgctg gacctggaga agcagttctc ggaagctaaa 
nggagaagtt gttcaaggga acgacttgan tcanctgccg gnttgcggct tggaaggaaa 
ntgggggggc ttgaanaaga agcccctgga atnccaccgg aagccccctt ttgggggggg 
gccttgcaaa ccgggaancc ctttnaaagg aatttcngcc antttcaang gttgggccaa 
ggggaatcnt accnaagggg ccttctnggc cttggnatgg tgaatccang gnaaattaag 660 
gtncccaatt gntgaancct tccaanggga ancccaaacc agcacccttg naanaagttg 720 
agaaaacttg cttgcntctt ntgacacccc tncnaggggg aacttcaagg aaccggttcc 780 
tnaggcttgg aaggaggacc cccananccc tggancctaa attnttaaat gggtnggacc 



60 
120 
180 



300 
360 
420 
480 
540 
600 



840 
844 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
799 
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<400> 425 

gangccggat tccaattntc nggctnctct naaannctgt ntaatgcttg gtccgcanga 60 

ncccatgcga ttcgtggagg tctcctttcg ccccagccca ggtggccaag cccatcctgg 120 

cctcagaaca tgctgagcac attttgtagg gtggcacctt tttatccaag ttactagcta 180 

cacatcagtg tttaaagaga aaaaagtgac ctttcatttt tttttcttga aacttgagga 240 

aacaagatac atactactga tttttttttt cttaaaacta aatgcatgac tgcagagcgg 300 

tagaggtgta tatttttcat actgtggggc aaagtatttg tgctgctttt tggagatgga 360 

ctggaacgtc tggtttctgt ccccgggccc ggcagctacg tctattttct gtagaaggtg 420 

ccacagtgag acctggagcc accccttcct gcctggcgcc gtttanagct gggagcccgt 480 

ggactccggc ctgtttctac cttctattca accactctga cgtggggaga caagaagaaa 540 

tagaactttt tgatagtgtg gtaaaaacat tggattttga actattttag taaaaggagt 600 

taccaacaag aatgtnatag gtgctacttt gagctagata aataaaggct ctttgtgagc 660 

ctcctgaaaa aaaaaaantt nnnnnnnnnn atnannnnnn annaaaaaaa ctggnccttt 720 

aaaactttan gggncgttta cctanaccct 750 

<210> 426 
<211> 819 
<212> UNA 
<213> Homo sapiens 

<400> 426 

gnagnncggn ttcttatgat cgtggctnct cntctanngg ttgtgtaatg ctnggtcnnc 60 

angannnnnt gcganncgaa ttcggcacga aggggggttc ccaatagtag aaaagggtcc 120 

ccattcotgc tcagcaccgc acctctctac ccccccacag acacacatgc agacacacac 180 

atgcagacaa cacgcagaca cacacatgca ggcactoaca tgoaggccca tgcacacaca 240 

cgtgcacaca catgcagaga catgcagaca cgcaggcaca catgcacaca tgcaaagaca 300 

cgcatgcagg cacacgcaga cgcacacaga gacacacatg cagatcacat gcacacacac 360 

atacacacac tggcccctgt ttttctgtgg tgtcactggg tgccagcaac toggtatctn 420 

ccaccttcca ctaaaacctg ggccttaatt tctctcccgt ccccacccct aaattcctga 480 

tggatgaacc tagagctgtc ctgtccactc caggccggac tgacgtancc tatgggccca 540 

gcaggtccag ggcccacgtt ttaatttctt tttnaaaagc tttaggtctt ggccnggccg 600 

ccggtggttc acgccttggg agttcccagc atttttnggg aaggccnaag gccgggttgg 660 

attcacaaag gtcaagcaag tttcaaggaa ccaagccttg aaccaggcca ttgggtgagg 720 

aaccctgggc ttnttactng ggnaaattcc caaaaaaaaa ttggccttgg gccnaagggt 780 

gggcaagggc acccttgttg gggtccccaa antttacct 819 

<210> 427 
<211> 750 
<212> DNA 
<213> Homo sapiens 

<400> 427 

gagnnngatt cnaattnctg ggctnctctc ttnntatnta atgctgggtc cgcangancc 
nntgcgattc gaattcggca cgaggtccaa ggacaacttc gagacatttc tttttgccac 
cgtatctaac agggagcagg aagatctctg ccgaggaatt gtccagctct gcttcaatga 

gcaaagccaa cagctgctag cagaggtcca gccctctgac tctttcctca tggtagagac 240 

aactgcatac tttgaggcct acaggcacgt cctggaagga ctccaggagg tccaggagga 300 

agatgttccc ttccagagga atatcgtgga gtgtaactct catgtgaagg agccaaggta 360 

cttgctaatg gggggcagat atgactttac ccccttaata gagaatcctt cagccactgg 420 

ggaatttcta agaaatgtcg agggtttgag acatcccaga attaatgtct tagatcctgg 480 

ccagtggccc tcaaaagaag ccctgaactg gatgactcca gatggaagcc ttgcagtttg 540 
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ctctcacaag ggaactggct attattcaag gaccttctgg aacaggcnaa acctatgtgg 
gtctnaaaaa ttgttcaagc ccttctacca acgagtcttg tttggcaaaa ttaaccttca 
gaaattccca tcttggttgn gtgtatacta atcatgcttt ggaccanttc tggaangctt 
ttccattgtc agaaaaccan atttggccgg 

<210> 428 
<211> 943 
<212> DNA 
<213> Homo sapiens 

<400> 428 

gnngnccggt ttcctattct cnggcanctc tcttcctncn acctattanc tggactctaa 
anaaaagnnt gnngcggttg gctcaagggc caccanaaca tttctttatt attattattt 
tttaacctgn acatgcntta aagggtctat tacctttctt tccgtctgtc tcaacagctg 
aaatggggcc nccaaggagt gccttccttt tgctccctcc tactgggact gacggntggn 
antgtntgnn cccarmtggg ggtgtctcct gnctgggaag ganggaaagg gaggcanagt 
tttgccgggg ttgcanntng acancangct gnanaggana tggctaataa ctgtttaatg 
gaaacctgct tgggcttgga nggaacttag nctgaatttt cccgacttcc tctgccagtt 
attgacacan tctctttnta agacangaaa taaactaaac cccaccccaa ggnantnatn 
ncangcngaa aacnncncat ngcccacatt ncctnatccc ntancaccnn ctcnttnttt 
nncccaanac tncttcccan ntntcnccnt ttacccntan ncntnnttnt atcccnctaa 
tncctnannn ccntnnttnt ccnnatnctt acncncncnn ntnnnncccn nntctttntm 
cccaaanctn ncctcncnnt tcnnctnaac cntntnnnca nnanacaccc ttctnatnnc 
ccannntctn cacnntnnnt ntctccnnnt nnncnccnnn ntcntnnnna nancntntnn 
nanancnatc tnntncncnn cnantnnnnt tcanttcacn ctctnnnnnn tntanccnat 
tnnccntcnn tnnccnntta nnncntnnnn nncnaantnn nnnnanctct ncnnncnnct 
canntnnnnt nnnnnncnnt cnanaccntn nnnntctatn ccc 

<210> 429 
<211> 775 
<212> DNA 
<213> Homo sapiens 

<400> 429 

gnangnnnnn nntttctaan tncttgggnn nnnngtcann gattnngcta aaggttngga 
tcncncgcag naangctgtg gcgctccatt gtgaaagatc caggcatttt tccgagccag 
gaaaagccca agatgactac aggatattag tgcatgcacc ccaccctcct ctcagtgtgg 
tacgcagatt tgcccatctc ttgaatcaaa gccagcaaga cttctctgct gctgtgatct 
gcacaccctc caacctgggc agggactggg gggatgcagt gtgtgttagt gcccatgtgg 
cattgtggca ctgttgcccc ccatggcggc atgggcaaga tgaccttcca ttagcttcaa 
gtcttgttct cttgtctgtg gtctgtttaa tatgtgggtc actagggtat ttattctttc 
tcccatcctt acactctgga tcattgtgca gacttaatca gggttttaac gctttcattn 
nnnnnnimnt ttttttgagc tcaaagaaag ttctcatttt ccctattcaa ctaataccca 
tgccgngttt tttaccttgg atttaaaggc accttangtt ggggcaacag attctcactc 
atgtttaana cctggnattc ancttcataa gaccaaagan ggagctttcc ctttctcttt 
acccctnagg attctcatcc tttacanntn gactttttcc aggccaattt cccatnnaat 
ctgcnanncc cngccttttg ncccaagctt ttntgntngn ccccccattt acccn 

<210> 430 
<211> 763 
<212> DNA 
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<213> Homo sapiens 



<400> 430 



ngggtgnnnn nntttctaat nctggggnnc nntnnncnnn ntttcctaat ncttaggngc 60 

tcgttctttc tccangcagn nnngcgtttc gcgacagctc tccaatactc aggttaatgc 120 

tgaaaaatca tccaagacag ttattgcaag agtttaattt ttgaaaactg gctactgctc 180 

tgtgtttaca gacgtgtgca gttgtaggca tgtagctaca ggacattttt aagggcccag 240 

gatcgttttt tcccagggca agcagaagag aaaatgttgt atatgtcttt tacccggcac 300 

attccccttg cctaaataca agggctggag tctgcacggg acctattaga gtattttcca 360 

caatgatgat gatttcagca gggatgacgt catcatcaca ttcagggcta ttttttcccc 420 

cacaaaccca agggcagggg ccactcttag ctaaatccct ccccgtgact gcaatagaac 480 

cctctgggga gctcangaag gggtgtgctg agttctataa tataagctgc catatatttt 540 

gtagacaagt atggctcctc cgtatctcct cttcctagga gaggagtgtg aacaaggagc 600 

ttagataaga caccccttaa acccattccc ttttccagga gacctaccct tcacaggcac 660 

aggtccccaa atgagaagtc tgctacctca tttctcatct ttttactaaa ctcaaangca 720 

ntgacagcag tcagggacag acattcattt cttnatacct tec 763 



<^XU> 431 
<211> 761 
<212> DNA 
<213> Homo sapiens 



<400> 431 

tggtgttnnn ntcctaatgc ttggnngnnn ggtnannctt ctaattactt tggggctcgt 
tctntctcna ennngenngg cgttncgaat teggcacgag ettgaagege tggtttttct 
cgaagcaatc cttattatat tgttaaacaa ggaaagatca accagatggc aacagcacca 
gattctcaga gattaaagct attaagagaa gtagctggta ctagagtgta tgacgaacga 240 
aaggaagaaa gcatctcctt aatgaaagaa acagagggca aacgggaaaa aatcaatgag 300 
ttgttaaaat acattgaaga gagattacat actctagagg aagaaaagga agaactagct 
cagtatcaga agtgggataa aatgagacga gccctggaat ataccattta caatcaggaa 
cttaacgaga ctcgtgccaa acttgatgag etttctgeta agegagagae tagtggagaa 
aaatccagac aattaagaga tgctcancag gatgeaagag ataaaatgga ggatatcgaa 
cgccaagtta gagaattgaa aacaaaaatt tcagctatga aagaagaaaa agaacagctt 
aatgctgaaa gacaagaacn gattaagcag aggactaant tggagcttaa ageccaagat 660 
ttacaagatg aactaccggc aatagtgaac aaaggaaacc gttttttaaa agaaangcen 
aanctgcttg aaaaaaaaaa aaaaaaactc ggectntaan t 



<210> 432 
<211> 748 
<212> DNA 
<213> Homo sapiens 



<400> 432 

gnngantnng tcttattatc gtggngctct nactnnctct aaatanaatt gtgttgnggg 
aatteggcac gaggccaccg aagcttcagg atgacatctt agactctctt ggtcagggga 
tcaatgagtt aaagactgea gaacaaatca acgagcatgt ttcaggcccc tttgtgcagt 
tctttgtcaa gattgtgggc cattatgett cctatatcaa gegggaagea aatgggcaag 
gccacttcca agaaagatcc ttctgtaagg ctctgacctc caagaccaac cgccgatttg 
tgaagaagtt tgtgaagaca cagctcttct cacttttcat ccaggaagcc gagaagagca 
agaatcctcc tgcaggctat ttccaacaga aaatacttga atatgaggaa cagaagaaac 
agaagaaacc aagggaaaaa actgtgaaat aagagctgtg gtgaataaga atgactagag 
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540 
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ctacacacca tttctggact tcagcccctg ccagtgtggc aggatcagca -actgtcag 
cttccaaaat ccatatcctc actctgagtc ttggtatcca ggtatttgtt tcaaactggt 
gtctgagatt tggatccctg gnattggatt tcttaaggac ttttggangg ^cttgacac 
catgc^cac agLcttggg cttcanaagc ttcanttttt tgcanaggtg ccccaggtta 
ggaaaacagt tntncttgtt ttgtannt 

<210> 433 
<211> 769 
<212> DNA 
<213> Homo sapiens 

gggnaaaagrtt^nannng ggnagnnnng ntnnaccntt cctattactt tggagctcga 
acLgcncca canannnagt gncntgngct gttttgcaga tgaggaaaac tgaggtacag 
aattcttagg gaacttaccc aaaatggctt ttctgcactc tgccctttgg tattgtccca 
tgtgaattg? ttaaaactta tgtgtatagt ggcatgagta ggtgatttca gaaacagaac 
tcacttttgt tgtttggtct taaaattagg aacttttctt catctgggct tcatttccct 
gcaccttccc agctttctag tcatgcaagc cacatgtctc cacgtgaggg gttcattgga 
aagcagccac agagccacc? cctggctggg ttcttcccca gctctgcttc ctccttcccc 
aagtcctgca gctgctctct ccatggcaga accacttctc cccttactgg aggggaggtc 
cactgaacaa atccaggaga ggaatcattg tgttttccac agaagagaaa gtaoactgga 
ctttctgtgc aacctgttac tacattttca caganactca f tttgtge. ntgt^ctc. 
atttgaaacc cagcaaaatt aggctcccgt gtctccataa aaggccacca tgatggtaac 
cgttggactt caccttgtgt ttnggacana ngctgattgg attttaccca tcatcacanc 
cgtgtcttac attotcnttt cctgggcttt ggacccctgn tanaaaaan 

<210> 434 
<211> 764 
<212> DNA 
<213> Homo sapiens 

ctancct^rtaaannctng gctactcgnt ctttctnnan ganncnnntg ^ n ^aat 60 
tcggcacgag caccttgcct ggccaagggg ctagacctcc caggctaagc ctcagattca 120 
gtlcaggac! caagctcatg cccccgtctt gccagtgaca cttgaagcct ccogacttec 
acaga^gct tcaggacaca ttttgagtgg tattttcttt tctttttttc ttcttttttt 
itt?4agat ggagtctcgt tctgttgccc aggctggagt gcagtggcct gatctcggct 
cactgcalcc tctgcctccc aggttcaagc gattcttctg cctcagcctc cagagtagct 
gggactatag acatgcacca ccacgcccgg ctaattttgt atttttggtc gigacg^ 
SSgccatgS tagtcangct ggtcttgaac tnctgacctc aagtgatcca ccactcggcc 
tccaaagtlt tgagatgaca ggcacgagcc accagcccaa cctgagtggt Jttttcttt. 
gggaccangt agactttaaa acgagggtaa gagaaaaagc ccagtggtct tttc tgangg 
taaataaatt tctgcccagg aaacnttncc aagccccaac cagcaagcca acccttaaaa 
aaaaaatcac ttcgtgttcc ccaangggan ctttnttaaa gctttggggg cttccaggrxa 
aaatcatttc cagtnnaant ttggaagaat tcannagnat ttnt 

<210> 435 
<211> 755 
<212> DNA 
<213> Homo sapiens 
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<400> 435 

gnnnnntttc taatgtggnn nngnnngnta annttctaaa ncttgggntc tcattctttc 

a^tSc C c catff ^ attC ^ -W-tcctt tccaglca^ ag^cct^ 
aaatctgacc catttaaagg agctgacccc ttcaaaggog acccgttcca gaatgacccc 
-jc-g-cc ttcaacagat ocatttggag gggacccttt caaagaaagt 
gacccattcc gtggctctgc cactgacgac ttcttcaaga aaoagacaaa gaatgJccca 
tttacctcgg atccattcac gaaaaaccct tccttacctt cgaagctcga cccctttgaa 
tccagtgatc ccttttcatc ctccagtgtc tcctcaaaag gatcagatcc ctttaoalcc 
ttagatccct tcggaagtgg gtccttcaat agtgctgaag gctttgccga cttcagccag 
atgtccaagg gtgcctgggg aagagccact gcgcatgtta tcttt^gtgt tactccagta 
VSZT? a * ct ^ tc ^ aggcagtgca tcgcanagag acattStaa gggaa'ct? 
tgaatcccta ancagcanca gctttnctga nggggccnat gatgccagtg acctottcan sso 
ggnaagtctg ggacattggg accaccctgg ggggaagaac ttg^gggatg tggcttttct 720 
tttatgaata aagtactttg agttggttgn aatcn tggcttttct 720 



<211> 760 
<212> DNA 

<213> Homo sapiens 
<400> 436 



JagSaSc^ ZT^ attantctgg gggctcgtnc tctctcnann 

nagnnaggcg ntgngaattc ggcacgagct caagaaaagg agaaagtttt tttataw™* 
attggaggaa atattgggga acgctgcctt gatgatgaca c^tacltgaa ggatttaSt 
cagcttaacc caaatgctga gtgggttata aagtcaaagc cattgtagaa Xcttaacaa 
gctgcagata accatgtgga cttctgtcat aattcttgct gagtcaagag Jgtaaataaa 
agaaatggca ggactcatat tattcagttg tcccaagtat ttaaaaatga SctcttaaJ 
tlttlT a3 tCat39attt ^gctgctgc cagaattata ttaattatta ttaatggta? 
llltT t* aaaatttct 9 ST^tgagagt naagangctt aattagtttg tgggcSttt 
fc ^ aaat9t gtCCa ^ at ^ ^ataagtt tttttSttt! aStg^gga 
aatgncttct ctttcccatt cttttctcct aaaaatcata tatactggga atatatgcct 
ctnttacctc tattaccctc ctcacattta ccctttccca gttngg^tt gctttttn^c 
caaaaagatt ccaatnccna ggtattggca agttntnaaa Lcgcccntt Xacatcclt 
aatttcncag nattccnnnc ttgccaaatn ttngtntcnn 

<210> 437 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 437 

ggnnnnngnn ngntnncgtt ccctattant caggngctcg ntctntctcn annnancnng 
gcgtgtncga attcggcacg aggattttcg aaactcttca gctacttgcc c^tS 
tgaaaccatc ataccttctg aaagaaaaaa gcatatcttc attgacaSa cagaagtgag 
atggcccagt cttgatacag atggtccatg atatatatgg agagtggcat tSaSta 

cattttttS Sill:?! 90 3taCCtCt ^ -tgcccagat cfcagcLga tlTeZtl 
cattttttgt gtaaagttag ccagaagtca gactcatctg cagtgttgtc agaaacaaat 
aataaagccc caaaagataa actagttgaa aaaactggca aaatctg^t acgtggaaat 
ttaccaggac agagactgaa gaataaagaa aatgagtttc attgccagat cZgllatcc 
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aaagaaactt taaagaagat gagttgtgta aatggaactg aagggagggg aagaactgcc 
ttcgcctggt acaaagcaca catgtgtata cacatgggtc aagcagtgct ggtctgtggc 
tgnctgtcca gangaatgga aatatccttg gctttagcac ttcattttca taataaaatc 
agcaattntg tctaaaaaaa aaaannnana aaaaactnga gcctntanaa ctntagtgag 
tcgtattacg tagatncnna catgataa 



<210> 438 
<211> 823 
<212> DNA 
<213> Homo sapiens 



<400> 438 

taatccttrm tattgntcgg gtactngntc tntctcnaag annntntcgt tncgcccagg 
tagctgagac tacccacacc ttggtcccag ctacttggga ggctgaggtg ggaaaatcac 
tttgcccagg aattcaaggc cgcagtgagc tatgattgca ccactgcact ccaggcaaca 
gagtgagacc ctgtcttaaa aaaagaaggg agaaagtgtc agatggtgat gaggtctggg 
ggggaaatag agaatgggga tcaggagtgt ggatggtggt attccctcac caagaggtga 
catgttgagc agggaacttg ggaggtgagg gtgtgacccg tgtggaaatc agggaaaagc 
attncagcct gagggacagc caatgcanag gccgtgaggt ggccagtgcc actgagcagt 
gagcttggga tagggggcan gtgangaggc tggagagcgg ggtcagacaa accaatatgc 
ttatttaaaa caaggttgtt ncagcaccct tgccttaaag ccttgagcct gnaanctnga 
aaaatttggg cacnttcaaa agcanggang gaaaccaaaa gaagattggg agggaaaagc 
ccttncnttc ccttancagg aaatgaagtt nccacccttn aaaacaggnc caggaccttt 
ttgggaccct tttggccttt tggttcctta gaatcctctt ggtngcttnn gaatnaaaag 
gnaaaagggg cctttaaggg gggatcccat tntttccaaa attcaaaggg ggctttccct 
gggcttaccc aaaatttctt ggncttaant aaaaaaattt ntt 



<210> 439 
<211> 767 
<212> DNA 
<213> Homo sapiens 



<400> 439 ^ 
gnnnnngntt ctaatgctgg nnnnnnnngg taccctttcc aaaacctggg ctctcgntct 
ttctncangn agccnngcga ttcgtctgtc tggtgatttt tattttaagt gaacctttgg 
atctatcttt aactctcttt attgtgagtc taaattccaa ttctgcagca gatcagtaaa 
ctcacagtat ttttcctgtg gaaatctatt caataaggaa accaagacag gatantaaaa 
tttaaaaaaa ancaactttg aattcccctg cctaggtctt ccagttgttt tccagcgcat 
acctcaggta tgactttgct agccggggac aaaattagca ccttccgatt ctctagtcca 
aatgaacttt ggctaaataa aaaattatta tactacataa taaagttnca gatagcagga 
aatgcaagag ctaggagatt cctagattat atctggccaa gccaaatacc ttaaacatcc 
acctggaaat cctctacccc ctcttctgag ataatttgcc cagccctttc ttcccacaca 
ctcactcaat gtcaccccct tctaatcccc aaaactgttt ttgtggcctt ggtagcctat 
agtagtttct cacatctttt cccctanact tttctgtttt cagtttcaga ccaaaaaaac 
tcttcaactt ttttccagtg gggtcttcct taccagtaac tttaccactt gnaatcttat 
ttcattgaaa aaaccttaaa tgggntggga aaaggcttgc cnncann 



<210> 440 
<211> 752 
<212> DNA 
<213> Homo sapiens 
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<400> 440 



nagnnnnntt tctaatgctt ggnnnnnnnn tcnatgcttc caaaagcngg gngctcgttc 
tttctccaag atncnngcgn tncgaattcg gcacgaggat ggatgagact gttgctgagt 
tcatcaagag gaccatcttg aaaatcccca tgaatgaact gacaacaatc ctgaaggcct 
gggatttttt gtctgaaaat caactgcaga ctgtaaattt ccgacagaga aaggaatctg 24 0 

tagttcagca cttgatccat ctgtgtgagg aaaagcgtgc aagtatcagt gatgctgcco 300 
tgttagacat catttatatg caattcatca gcaccagaaa gtttgggatg ttttteagat 360 
gagtaaagga ccaggtgaag atgttgacct ttttgatatg aaacaattta aaaatcgttc 420 
aagaaaattc ttcagagagc attaaaaaat gtgacagtca gcttcagaga aactgaggag 480 
aatgcagtct ggattcgaat tcctggggaa cacagtacac aaagccaaac cagtacaaac 540 
ctcctacgtg gtgtctactc ccagactncg tacgccttca cgtnctcctn catgctgang 



<210> 441 
<211> 775 
<212> DNA 

<213> Homo sapiens 



<400> 441 

gnagccngat tccaaaacct gggnnccgat ccaatgcttn ccaattactt gggagctcnn 
actngcncna ncaanctngc cntgcgaatt cggcacgaga agnaggcgga gcttgcagtg 
agctgagatc gcgccactgc actccagcct gggcaacaga gtgagactct gtctcaaaaa 
aaaaaaaaaa aaatggaacg cagggcaaga actcgtnttt ggaaggagat gggggaaagg 
ancggtatta tacctatgtt gnatttgcag gcaaatgaga tgganccctc tctgtaaaga 
agagtcattt gtgcaagtag acggggtctg tgggtgcang ccctggaggg gcacaoaatt 
gcctgnangc ttctgtgana tcgggagang gaggagaagc agtctcttga caaaataaag 
tatttttatt cattngtatt tattaaatga aaaaacaatc ccatggtgtc ccttgtgtgt 
ggtggaacct aatgactgtt gaaataaagt ctgngtttte ccttcaaaaa aaaaacncnn 
anaanaaaaa ctcgagccct ntaaaacctn tngngagtcc gnattaccnt anatcccnga 
onttgataag gatccattga tnaantttgn cccaacccca actnngaatg ccnngaaaaa 
aaattgcttt atttgggaaa tttgcnaatn ctttgcttta ntttgnaccc antttanont 
cannnnccaa gttacnancn ncaattgcnt tcatttangg ttcaaggttc aaggg 



<210> 442 
<211> 804 
<212> DNA 
<213> Homo sapiens 



<400> 442 

gagnnngntt ctatacctgg gnncgatcca aancttncct attaccttgg atcttnngct 
atctcnaann aaaangcttn cgaattcggc acgaggccac ctgcactgag gtctgggccc 
ggggacaggg tgctttagcc aggcttgtct gcgcctcagg gaagggtgag cagcccaggg 
accagatgca agttggtggg cccctccacc cntccnacgc cactccccag tgtgctgggt 
cctaaccagt cgtcctatgg gagcagtcag ccttcctctc ctcctcaggg cagctctccc 
acctgctgnt ccccgcacao agaacctcat tgctctgagc agttgcttat tacccagttg 
ttgaaaaact agcatgtgan ggccgggcgc ggtggctcac gcctgtaatc ccaacgcttt 
gggangccaa ngcgggtgga tcatgangtc aggagatcaa gaccatcctg gcttaacacn 
gtgaaaccct gtctctacta aaaatataaa aaagtancca ggcgtggtgg tggcccctgt 
agtnccaann tacttgggaa gctnangcan gaanaatggc ntgaacccaa gaaggaagaa 



60 
120 
180 



cgcaatacac cgcttcttgg gtcangaatt agaagctact gggaaaatct accttccgac 660 
agaagagatc attttagatn taccgaatga anaaagcttg cattagtgac attgaaaggg 720 
aaataaaaat tcctacagtc naaaaaaaaa at 



752 
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cnttgcantg aancttaaaa ttgcgcccac tggaatttca aaccttgggc cnanaanaat 
tgaagaatcc cgtcttaaga aaaaaggaaa aaantttncc nttntnaaag gcccggccac 
aantnggctt taacgccctg gtaaatnccc aancactttt tggggaaggc ccaaaggcaa 
ggccnggatt caattttnna aggg 



<210> 443 
<211> 786 
<212> DNA 
<213> Homo sapiens 



<400> 443 

gnagccggat cttattattg gcnncgnttt aatgctggct aatntntcgt aatncttggt 
LcccaL annnaggngg ggngaattcg gcacgagcac catttttatt «*atgctta 
cactcattta ttctgttttt gtaaaacagt ttcaagaatt taaaaatcct tccagttaat 
agagcttttg ttattatatt ataattttgt aaacccactt tgtttttccc actttaaagc 
clcagggtcg actcatggat gatacctcta ttgctgctgc atgatgttca agaccggccc 
ttggctgttg ttacagagat gttgggcaga gctatgcagg tgtttcattg ngaactctag 
ctttgatcat ggtaaaaagt taaccctttc tattttttaa tggatgttat accaactatt 
cagaggactc atacttcaaa aatattagga aaatctgtct tatagttctc taataaatat 
ctgaaatctc aagtacgaca tgaaagaatg tcagaccatt gntattggtg aaagtcattt 
gatgaatggn aaattctatg aaaagtaagt ggatttgcat ggattaatat cagggaaaat 
ttaagccttc ccaagtgtga ctgggccaaa gagagccaga tgcccccagt gcctgtgccc 
ataaagttcc cgaatccccc aatggggtct nttttcaaaa acttggncca gacccggaaa 
ataaaancat tcntcataaa ttcaannggg gncctcanga aacacnttcc cccancaacc 



cttngg 



<210> 444 
<211> 760 
<212> DNA 
<213> Homo sapiens 



<210> 445 
<211> 761 
<212> DNA 
<213> Homo sapiens 



660 
720 
780 
804 
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<400> 444 

gnagnccggt tcnnangcnt nggctnnatc caatgctggc taaagttcna ananctggca 60 
Snccaggan ncangcgttg cgaattcggc acgaggagga attacaggta ^caaattatg 120 
gagttggagg acagtatgaa ccccattttg actttgcacg gaaagatgag ccagatgctt 
tcaaalagS ggggacagga aatagaattg ctacatggct gtttnatatg agtgatgtgt 
ctgcaggagg agccactgtt tttcctgaag ttggagctag tgtttggccc aaaaaaggaa 
ctgctgtttt ctggtataat ctgttgccag tgggagaagg agattatagt acacggcatg 
cagcctgtcc agtgctagtt gcaacaaatg ggtatccaat aaatggctcc atgaacgtgg 
acaagaattc gaagaccttg tacgttgtca gaattggaat gacaaacagg cttccctttt 
tctcctatng Itgnactctt atgtgctgat atnccatttc ctagtcttaa ctttcaggag 
tttacaatng ctaacactnc atgatngatt cantcatgaa cctcatccat gttcatctgn 
ggcaattgct taccttgggg gntcttttaa aaagtaccac gaaatcatca tattgcatta 
aaacccttaa aagttctggt gggnatcaca gaagacaagg ccnaanttna aagnggagga 
attttattat ttaaaagaac cttttgggtn ggatnaaaan 



180 
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300 
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<400> 445 

tggtgccggt tcttantetg ngctctcgtc ttccttctta tacctgggca ncncttggcg 
gccccnaggn tcccangnag ccnngcngng ncngattcgg cacgagattc caaaggtttc 
aaagaacttg gtcataaata tgataatgag aagacaaagt atttatatta aaacagttta 
gtagccttca gttttgtgaa aatagttttc agcacagaaa ctgacttctt tagacaaagt 
imTt . f atg9t9ttt Sr<=ttctagga tatacacttt aaaagaactc aeJgtccJg 
tggtggtcat tgatggcctt tagtaaattg gagctgctta atcatattga tatctaattt 
lltr.^ CC l t^^ 9 tccttaa "* ccaacagtga agcactacag gaggcaactg 
ta?tttt?aa HT^T tgtg-tgtt ataaaattg^ cactcagatf 

IStnn I a ^ ta atgtt atataagatg atcatgtgat gtgtccaaac tatggtgaaa 
agtgccagtg gtagtaactg tgtaaagttt ctaattcaca acnttaattc ctttaaaatn 
cacanccttc tgcctctgna tttggaagtt gtcagtncaa ctcatcaaag aaaactgcct 
aatntnaaaa tcatattntg ggaataattt ccctcttttg tagtctgccc aagatcctta 
aagattggat ttttattact atttaaacca gtggattaat n gatcccta 



<210> 446 
<211> 770 
<2X2> UNA 
<213> Homo sapiens 

<400> 446 



tgggnrmgnn ccnaangcng gggannnggt ccccgttcca anactggaan ncttggcann 
ta^t"" 9 Cn ^ Ma racegggnga attcggcacg aggccccgct ccatgagcS 
tgactcccca gctcctcctg gcaccagtcc ccagggctct cctgttggta gttcctgctt 
ttcttcttgg aaattcctcg tggacctcga gatctttacc ctaaaaSgt Ltgttgaat 
ttcaccctgg caatgtaaat tgatagctta tcttcacaga tgccagacaa tggacaactc 
accatcagtc ctctgctcac ctgagacaaa tgcatgtctg attgcttcct cSccetatt 
ggntatgtga aaatgcagat tcactgagcc agactaaggc atcagtgact ggtcctctac 
cal^^r ^^ ggagatt gwfttcag tgaaaggctg atcaaagacc caaaggaatg 
caacagttta tctcttatct acctatgacc tgcganctgc caccaccccc agntggngcg 
cctttccaga cagaaccagt gtacatctta cacgtattaa atngatgtcc cnggggctcc 
cnaanangna tcaaacaagc ngggcctcga ccaccttggg cacatatccc nanggacatc 
annctggagg ct ngngncac tggcattggc cctnaccctn ggcaaaataa accttctaaa 
attggnaaaa aanaaanaan aaaaacctng nnccctntna naacnntacg 



<210> 447 
<211> 757 
<212> DNA 
<213> Homo sapiens 



<400> 447 

^^ ntt tnaan ^ ct ^ nngnnnggcn ccgttccaat gnctggganc nttggcaatc 
^ -angatccca tcgattcggt ctgatgcagg agaattgcta aaacccagga 
gggagaggtt ncattgagcc gagattgcgc cactgcactc tagcctgggc gacagagcaa 
gactccgtct cgaaagaaag aaagagaaag gaaattcccc agggaagtac ctcggcttat 
ttcataaaca ggtactgaag gaagcagagg catgtggagg acttccccac ctcgtgcagc 
^22°° gtggCatCtg a **tttctta tttcagagtc acccctttga tgaccttggc 
agtgaactgc agtcatctgt ttaggccttt ccatggccca cgtcaatgcc ggtatttSg 
tttgttgcac atttgatttc cttgttgttg gcatttagaa ggccccccgt ttcccagatc 
acaccacggg catggaccac agagattgca tcttgtgagt ctgtagaaat ggtcaaggcc 
ttgtcctctc ttaagtccag agctcangtt aatgcaaaat tttnccggnc atctgtgctg 
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aaatcccttt ggggaagctc ctggctggtt tcctgtaggt aggacagcta cacgtnctgc 
ccttSttgg SSttttca tgaagctcct gccatntacn aaacatgtct cccttcttga 
atcacatctc tggtattgna actctanaat cgcccgg 



660 
720 
757 



<210> 448 
<211> 770 
<212> DNA 
<213> Homo sapiens 

gggtgnnnng^ctaatgc ttggtngnnc nggnccnacn tttctaatgt tcggaanggc 60 

SLSactc gntctttctn cangnagccc ntcggtncga attcggcacg aggtgtcttc 120 

atcttaccca g^ggaaccta agaaattaaa ttctccagaa gaaactgctt ttcagacacc 180 

aaaatctagc Lgatgcctc ggccttcagt gccaccatta gttaaaacat «ctgttttc 240 

ttcaaaaSa tcLcacctg atgttgtgag cccatttggg accccatttg gctctagtgt 300 

aataaatcgg atggctggaa tttttgatgt aaacacctgc tatgggtcac cgcaaagtcc 360 

SagctaS aglagggggc caaga tt gtg gacatcagct t-tgatcagc -atgactga 4 

.tt!tct«t ccttctccat ctacctctat tagtgctgag g^aagacaa tgagacaacc 48 

cagtgtgatt tattcatgga ttcagaataa acgtgaacag attaagaatt tc "9"aaa s 

acgggtgctg ataatgtatt ttttcagtaa gcacccagag gcctncattc aggctgtttt 600 

ttcfgSgcc caaatgcata tttgggcatt agaaaggtct gtcgcactta ^tagcagcat 660 

cattttacag aggatagatt tggagttgtc cagacgacac taccagctat ccttaatact 720 
ttgttgacac ttgcaagang cagtcngaca agtactttaa cttcctcatg 

<210> 449 
<211> 792 
<212> DNA 
<213> Homo sapiens 

<400> 449 t - aJ ,t ac ttaa 60 

120 



ngagaaangt ttctaatgct ggnnnngnna gntcancctt tctaatgttc taatacttgg 
gaStcgaac tntcncnaca cagnnangcn ntgcgaattc ggcacgaggn 
atnacttgnt cncancggno tggcatcnac ncgncacacc tacntnagcg ^tgtagcg 
caatatncac ctnntnaaac ccnnnagtcc cagggctctg ccnnnnnact gntcaactga 
cnaac^cS ncSncncaa cntnnnntta ngccnctgnc tgnctctatg gcacctnncc 
ITclTcZn cntnacccnc tacgctcagg gctatataca atgggaacct tnccaacagt 
aanccntgga tctnaggnat ggcccttgnc tggcggatca cagccttnna 9^^cagn 
atcttgagga agacaccatt ccgtcocnga ttntgaccaa ncnctcggat gtgnctatgg 
gctcnattga ggnacaacaa ctnncactgc nnataggcca tcctcnnnan nctacacatg 
ngactttnon nnncatntna aatgnnnana tgtctctcnc aagcatcacc ^ctgtccct 
nLncntcnt ggaagacctt ctgnncaact ganctccttc ntgnrmcnnn ngattnttnc 
Scntncccc aatgnccttg tnnngnattt atnangggnt ttccaatttg 
ggSlatttca ntncccnccg nannctannn ncccatnaac cntcngngcc ttcttgnaac 
cttttnncct gg 



<210> 450 
<211> 848 
<212> DNA 

<213> Homo sapiens 
<400> 450 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
792 



114 



WO 99/33982 

PCT/US98/27610 



gnatgncccg atttccttaa tgatggggnn nnnxmgagcg anncttccga aanttccaat 

SSSS ?ctJt t a : t CtC nCt — ^aaggncgn gggctttgct ctctcca^c 
caagttgntc tctgttctag aaagcagatg tagtagacat ctactgttgt tgcctgaaca 
gaatcccttt gtcctttttt tgntaaaagt actcatccct aatattcatt g^nctgaJaa 
gactgaaaat acagaactca caccatgatc ggccgggaca atcagattat Sc^ttccnc 
agcaaacgga gatcganccg aaaagtggaa anatgagcnc ttctttggng ttggcatata 
llltl 9 T. 9aaagaaCtn tnattntttc tcttggactg caataaagtl ta^tgccta 
! ^ CCt * acac " ggaggnttgt ccacaatcgg ngaaataaag gcg^Iccgn 
tnon^ 9 3 aaaaaaanaa gnnnccngnn gaanacccac tnnnccannn nccnLccS 
nnganCOnnn ^nccgnnan naggccnnng cnntngcnnc nnng^^S 
nnnnnngggn aaacccnnnn gnnnnncnnn nnnrmnnncn nnnnannnnn r^Jn^Z 

alZSaSS ^T anrmnC CCCn — cnncnccnnn nggnaanncc nnnS^ 
annnngggnn imnncnannn ccnnnnnnnn cannncnncn cnnnnnggnn nnmmcnZ 
Z~ nCnn9n9nnn a --gngn nnnnccnnnn nnnnnncngg 8 40 

848 

<210> 451 
<21i> 765 
<212> DNA 

<213> Homo sapiens 



<400> 451 



SclSZ^ fc = Ctaaat3c "gggnnnnn nnngagngnn nttncnnagt ttcctaanta 
tccSl^r ^ tCgttCtnt =tncangcag nnnntgcgtn gncgaattcg gcacgagcat 
t C l C ttt9 "aacgaagc aacatttaca caagatggac attacattat tagtgcatcc 
tctgatggca ctgtaaagat ctggaatatg aagaccacag aatgttcaaa tacctttaaa 
tccctgggca gcaccgcagg gacagatatt accgtcaaca gtg^gattct acttcctaaa 
aaccctgagc aotttgtggt gtgoaacaga tcaaacacgg Jggtcatcat gaacatacJa 
ccctStcc ST? 0 " 0 a3ttCt ** ta -agagaag^ tggggacttt SttgctgS 
caat^So! ? 93 99atCtact ^ tgtaggggag gactttgtgc tctactggtt 

cagtcagtca ctggcaaact ggagagaact ttgacagtgc acganaagga tqtaattSt 
attgcacatc accctcatca gaacctgatt gctcctacag tglagatgga ctcctaaagc 
tctggaaacc ataattcaac ttttcttttt taaatcaact cgaaagcatg tncttaaata 
TtttlttZ ^^30 tttttttttt tgncactttt cLagcaaat agltggctgf 
attagtcacn gaataaattt gngaaaatca tggttaaatn ccaac 9 



<2X0> 452 
<211> 765 
<212> DNA 
<213> Homo sapiens 

<400> 452 



nnngnnnnnn ntttcctaaa tgcttggggn nnnnnngngn nnnnnttctn atgttcctan 

tSacf^ Ct r C9M19 CCngtgCg * fc ^cggtctga tt£l££ 
"!! ggttta tcat 9 caa at cctcgcctct ggctacggct ggctgaatgc tgcattgct? 
a^tctaS? 9aC " Ctgaa caa ^acta aaggccttcc cagcLaaL ggaattgtnc 
Sctgtttat tltT 9Ct 3tCatCgtaa aa tagttttg gcatcacagt Satacfgaa 
aSat^- aatgat9g3C a 3 tc =ttcggc cattcctgta ccagtatgga gtttgcagcc 
atatgtctca gaaatgcctt gttgctgctc ctgaagaaca gcaagatcca LacLaS 
atggggctaa aaatagtaat caattaggtg ggaacacaga gagcac^ gcagtSaa" 
ttgcagcagt naaagccatg atggagatna attcattcca gc?tcacctt cttctccatt 
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gagaaaacag gaattagaaa acttaaagtg ctccatactt gcttgcagtg cctacgtggc 
tctggctttt gggtgatacc tcatggcttt gaatcatgcn gatnaacttc ttcagcagcc 
caagctgcag gatctcttaa gttttgggac atttatatgc tgcagaaccc ttatcttctt 
cgacngaatn tctgtgccnt tctcacttga ccccgagaat gtnct 



<210> 453 
<211> 833 
<212> DNA 
<213> Homo sapiens 



<400> 453 

gngtnnnnnt ttctaangtt cntaatannn tnggctactc gttctttctc caggnatccc 
ntgcgattcg aattcggcac gagagaaacg ttctcaggtt gaccagctgc tgaatatttc 
tttaagggag gaagaactta gtaagtcatt gcagtgcatg gataacaatc ttctgcaagc 
ccgtgcagcc cttcagacag cttatgtgga agtcaagagg ctacttatgc tcaagcagca 
gatactatgg agatgaatgc actgaggacc catagaatac agattctaca ggggattaca 
agaaacatat gaaccttctt gagcacccca ggttttggca ttagaaaatg ggtacccctt 
ggttcaaaaa tgaacaaaga aagccttaga tttggatggg ggaacctgat ctgtccagtc 
tagaaggatt ccagtgggga aagtgtttcc atttccttng tcccctggct tggccaagga 
aagcgaaagc cctttcttga anagcaaccg tggatcattn gaccaggaac ttccttctgg 
ggtattaagc ttctttcaan tggaagggaa aggttccang gccaaaggaa aaatggaagc 
ccccaaccng atgggtttca ccctaantaa cctcaattgg aagggcttgg accaagaacc 
cnggaaaggc nanccattgc acccttaaaa ncaaggaaag tggaccacct ttggggcttg 
ncnttccntt ccgaaccagg ttgaaaangg gcttgaaaaa tggttgctta cccaaaaggg 
cgnacnttaa tggcaccaat tattcctntg gaccnttttt aatanccttt ngn 



<210> 454 
<211> 737 
<212> DNA 
<213> Homo sapiens 



<400> 454 

gnggnnnntt ctaagttcct aatnctgggc tactngttct ttctgcaggn atcccatcga 
ttcggcaaaa tcaatgtgga ctgaacataa atcacctgat ggaaggactt actactacaa 
cactgaaacc aaacagtcta cctgggagaa accagatgat cttaaaacac ctgctgagca 
actcttatct aaatgcccct ggaaggaatn caaatcagat tctggaagcc ttactattat 
aattctcaaa acaaaagaat ctcgcttggg ccaacctaaa gaacttgagg atcttgaagc 
aatgatcaaa gcttgaagaa agcagtaagc aagaagagtg caccacaaca tcaacagccc 
cagtccctac aacagaaatt ccgaccacaa tgagcaccat ggctgctgcc cgaagcagca 
gctgctgttg ttgcagcagc agcagcggca gcagcagcag cagctgcagc caatgctaat 
gcttccactt ctgcttctaa tactgtcagt ggaactgttc cagttgttcc tgacctgaag 
ttacttccat tggtgctact gntgtagata atgagaatac agtaactatt tcaactgagg 
aacaagcaca acttactagt acccctgcta ttcaggatca aagtgtggaa agtatncagt 
aatctggaga agaaacatnt taaccaggaa actgtanctg attttacttc caaaaaagaa 
gaagaggaga gccacct 



<210> 455 
<211> 7X8 
<212> DNA 
<213> Homo sapiens 
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<400> 455 

ggnnimnntt tnnncngttn cntaaaannc tgggctactc gttctttctn cangnagccc 60 
aSS=r 9 J attCggcac ^gatgagg agtgtttaat cattgataca gaatgtaaaa 120 
caaatSC * ggaaag3Ca 9 ct 3"gtgg gttctaactt aagttccaga ccagctagtc 180 
caaattcttc ctcaggacag gcttctgtag gaaaccagac taatactgct tgtagtcctg 
aagagtcatg tgttttaaaa aaacctatca aacgagtata taaaaaattg aLcagttgg 
agagatttta aaaatgcagg atgagctctt aaagccaatt tccagaaaag taccagaatt 
gcccttaatg aatttagaaa attctaaaca gccttctgtt tctgagcaat tgtctggtcc 
ttcagactcc tctagttggc ccgaaatctg gatggccttc tgcatttcag aagccaaaag 
gacgattgcc atatgaactt caggactatg ttgaagatac atcggaatac ctagctcctc 540 
aggaaggaaa ttttggttat aagttattta gcctgcaaga cctgttgttc tcg?cgctgc 600 
aSKat aga ggatagagnc a ^ccacgtt ctaaaacnga gaaatcagaa gacatttnca 660 
gttatgtctc caaaagtgag tntcagctgt atgagttgac tctgctgaaa gtgacttg 



<210> 456 
<211> 739 
<212> DNA 

«.2I3> Homo sapiens 



<210> 457 
<211> 743 
<212> DNA 

<213> Homo sapiens 



<400> 457 

SS^ST t ^ nnngttt ccaattantc tgggngctcg ttctttctcn anncnnnnan 
tggttgncga attcggcacg aggnnanagg gnagctacat gnntnaccnt nttngnnctc 
^ a ^ a ^ tCnnCtnnnn "ggtcctac tgctacatag aacacttgtt ntncnnggna 
actrmtntat gtnnccnnga ntctctgnna ctngtttaaa tgctanttga taacaggcta 
tgcaaggnct gnaagtggan agcgtcatca ttcatcatnc ntnttanctn gantnnntgt 
a ™ aC ^ g C " tgattgg taaatgngcn tcagactggg actctcaata aatgnatata 
I SZZlnt Ctg ! ggaaan ct 9tcctc t c ntatctntnc atgngnaant tccactncag 
toeS^T aaatgCnntn ^nggnganc cctncttgta tagtggtgtc cattccaanc 
S^ 39 ? 9 ^ tagaaacc 9 t cggctntngg aaacnatggt gnnagttgan ctggtacang 
cngttntcac ctgcanctac cataaaatgg gnttacccaa gctttatcat ggaatggnt! 
taaaaaacgc attnattgng cctttntaan cccattatnt gttnaatttn acttaSgtt 



240 
300 
360 
420 
480 
540 



718 



<400> 456 

gtggnnnntt ctnngtttcc aatangntgg gtctcgttct ttctnnacga tcnnntgcga 
ttcgcttggg aggctgagtc aggagaaatt gcttgagccc aggagatgga ggttgcagtg 
agccaagatc atgccactgc actccagact gggcaacaga ggJagacJcc g^tSaaaaa 
ctaaaaaaaa aaatncattt agtataccgg ggggtggggg ggagaaataa tgttatttcc 
tatgcgaaat gacgtgtatc cctgtaccca tgggtaaatg taaatatact gtgtctcttt 
tgggagagcc ttttagtaga ggagtcttat atgaagtctc tcataagtag ^tcacttgag 
ttttgcagtt tgaaatctta aaggagcttt aattgacatt tattatacca attaagcttg 
gaatggggca atggatgcat ttccaaaacg tgtgaaagcc taacagctta tattgctgaa 
tgagaatctc ctgggtgtaa tttancactt agggaactgc gtgaacactc ccagccatta 
tgatgctggt accagcttta ntgtntaaat gccatganta ttctttctgn tctgttttgt 
Stt^SS ^""f t " acccttta cngaataatt tcttgtaaaa tccntaaaaa 660 
tntttggcat ttaaaagtcc nntcttggan tnaananann nnnaanaaaa ancttncccc 720 
tttanaactt tngngggct /zu 
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ccccccattn aaattatnca attgggnann gangcttcna gtcnccatnt ttnaatggnn 
ttnncaaaaa aacgnttttt ttt 



<210> 458 
<211> 906 
<212> DNA 

<213> Homo sapiens 
<400> 458 



gnngnimrmn nttnctaatg cttgggnncn cgtttctann nnnnnnnnaa ^^ttcctaa 
?tggttaggn gctcgnnctn tctccacnna gnnnngcggt gcgaattcgg cacgaggctg 
aaSaaggat cacaaactnc acatttngca cnttggtctn cacatncntg gttngggcag 
tcncagSaa catggctntg gaaactnatn ttngnctngc ntcaaccatc tcgttcccng 
gggacccaL ntccLnatc ncgnnttncc tcgnnnatng gagngctnct tngnccannn 
aSjggctccc nanaatangn ntnncnnngn nnatncancn ncngncaarm ggntcnncnt 
nnnnnngccc trmtncctna tggnnngctn catgncccat nnnnnnggnn ancaataann 
naaanggtct nttcccncga nncccnnnnn ccnctaacan ngnacctcgc ^gcccc 
aggcnttnnc tngnaaacca nnttngccaa nggtanttca aaggnngcct tngggacctc 
ccnannnngc cntggnnnta ccccgggnaa anggtngnaa acccnnccnn ngntgccnnn 
cccggncr^g gaaanaaatt tccnnggnac ccagnntncc nccgnaannn ananta^nc 
ccancccnaa cnttnngccc ncanccnttn gnnnntgnan tcnncnnncc ^ttnnnntn 
nccaanncgg ccnggnnacn nncttnnacc tntttncncn naanngacnc caanntc^ 
nannaaaggg nggnnnnnnn nnncnttncc nnnggnagcc cnnnnnncct nncntnncnn 
aaaaattcnn cnLgnancn cccctnnnnt nangngnccc natnnniumn rmngnaaanc 



nnaccc 



<210> 459 
<211> 765 
<212> DNA 
<213> Homo sapiens 



<400> 459 

gnngnnnngg nttcctaang ctggggnccg ntctnnnnnn nnnnnnnngt tcctaaanac 
c^SSctc gntctngctc cacgcagncn gggcgtgggc gaattcggca ^cttctg 
ttgSSgtt tgtttaaagt acctaagtac taccctttga ctccctacca aaagttcttt 
t^tttSaa acaactttta tttgtgactt actttcttga gaagngttct ^atgaattg 
cataaaatag tggtagcagc ttatttctta agtncttnat tattggggct "accattca 
ggtcttatct ttaaccctta tttactcagt tttccatctg aatgatccta "tctaaatn 
aaggatttaa taaatgctgc aaattgtcca ctttgcaaat ngtccaaaag ctttjgtttt 
ggaccttgng aacttttttt ttaataacac attatttggg cccggtcgtg ^ggctcaag 
cctgtaatcj cagcactttg gaatgcctag gcagacagat cacttaangc ctggagttcg 
agaccagcc? ggccaatgtl gtgaagacct ccggttctat tactaaaaat nctaaaaaa 
SncaaLca tggnggtgca cgcctgnaat ctcagctact tgagangcaa atcnggagaa 
atgcttlacc nggangcan anatganccn anattgcacc actgcattcc acctggggan 
nanantgaga anctggctca aaacccaaaa acccaaaaaa aaaan 



<210> 460 
<211> 677 
<212> DNA 
<213> Homo sapiens 



720 
743 
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<400> 462 



ctaatggctt ggmumnnng nnnnccgntt cttaattgnc ttgggcnnct cgctctntct 
ccannnagnn nntgcgttng cgaattcggc acgagcctca gcclcacacc agctcSttt 
caggggtgag agtcagagag cactgcaata tgtgcttcat gggatttcga tSaaiatc 
ctagaccagg gagacactgt gagccaggga tacaacaaaa Xctaggtfa gtcactgcag 
TtTcZT* ZIT 9 ^ 9 , tgggtttgtg gagaatcaga 2c2£££ 

tcIgg^S ggccagggat cctgtggaac ctcttctagt 

tttlSf 5 9 ! Ctgcca ^ tt Sr tctagtgaca tctgatgctt gctgtgaact 

tatJStat !" 9a ^ CCtg a ^ a "tcaa tctttaattg ccctgtattc cgaagggtaa 
tttntll t ^atgga. attttaaaga tgaatccccc ttttttcttt tctnctctct 
tttctttcct tctccctttc ttctttgcct tctaaatata ctgaaatgat tta^tatgt 



60 
120 
180 
240 
300 
360 
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<400> 460 

gtttncgctg ggagccacca acatagcaga ttaccatgtg aagttgccac tgctgcatct 

cctgaaacct ggctgatggg agaggtctca ttttgtgtct gagaatgtcc aggtSctq 

Stttata 3 ? ! aCt9 ? tttC CCattagcag tt.tt.tttc ctggccfttt Scctgaag 

Igttatcaac ct'acatta f ttttatcagc -^agggctg tcattcttc? 

ggttatcaac ctctacatta tgaagtaagg ttcaaccctt ctgcttttct caggccccca 

TtTalltT, tat !! aatC9 aa — aa cgggtattga gaaggaattg gc^ggctca 
gtggctgttt ccgttgctcc taoctcatgg agactcttac tcatgctgga tttlSaqa" 

Sccactttc ! t tgaC f aCtC aCCCCCacc c actcttatgc agtctgttca ttcctgaS 
tcaatf a ^ctcctg cacacaaccc atgagggatt gctacttcct ataagattcc 
tcagtgagcc ttatagagtt gctgcgagaa ttacatttgg tcatgatgtc aagtSctaa 

ssr 69 aacacatagt aatttattgg aataatt - sss; ^ 

677 

<210> 461 
<211> 787 
<212> DNA 

<400> 461 

gmmnnnnag ggnnnngngg ggcctcncaa agcccgngcn acaggtcccc gttccaaaac 6n 

cSttcS SIT r a3naa9g C ^ aan * cggScgagg acat^t^ J 

gagcn^atc tllltltl^ ' a = aCa * att caactttaga gaggacnggg gggnrmncng 18 o 

f™ 333 t a39gaag9an tatoacgngg ccncccnnga atataarmnn gaagctgnga 240 

acagnaccat cagnaacann nnatggacag ctctgatggg grmnatacca cggcactcS l 00 

cnnaccrmng gnggaagcna tccggagnna tgactgangn gnaaagnggn n^act^aa fi 

aanccnggng ngctaggann ctgggagagn caottLaig LgnJSS SganS^c III 

atcanaagaa cccgganaag ngagaagacn ggaaaaagnn cncancgnac 480 

nannnncnot gagccanggg ctncgaaang ccccaccnga agcnccatca cfnaScS sin 

ggnnngggaa aaggaancna cnnngcngac angnccnccn aanagngcca JS2E£cS 600 

nngcccncnc gcccaaagaa nacnggacng cnggcncnna ncaiaaaggag cSnanaacc tin 

cnnggnaang aaactncnag nagcccaanc ccaaaggccc cnanggSS 720 

gaaaacanna nncacccaag gggcctgggc naanaaggcn nccclcncng gc^Sc l\l 

787 

<210> 462 
<211> 747 
<212> DNA 
<213> Homo sapiens 
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gtcaccaatt aatgatcttt tattcaatct aagaaatggn ttaagttttt ctctttagct 
c tatggcatt tcac tcaagt gggacagggg aaaaagtaan tgccatnggc tccaaagaat 
tnntttatgt tttagctatt taaaaaa 



<210> 463 
<211> 750 
<212> DNA 
<213> Homo sapiens 



<400> 463 

tncctttcta angcnnntng nnaanngtcn ccgttctaan tncttgggca gnncgctctn 
tctncannca gncnntgcgt tgcgaattcg gcacgaggcg agatgaagct acactgtgag 
gtggaggtga tcagccggca cttgcccgcc ttggggctta agaaccgggg caagggcgtc 
cgagccgtgt tgagcctctg tcagcagact tccaggagtc agccgccggt ccgagccttc 
ctgctcatct ccaccctgaa ggacaagcgc gggacccgct atgagctaag ggagaacatt 
gagcaattct tcaccaaatt tgtagatgag gggaaagcca ctgttcggtt aaaggagcct 
cctgtggata tctgtctaag taaggattcc atatggctct catatcattc cattccatct 
ctgccaagat ttggataccg caaaaatttg tgttngngga agattctgnc tgaactcttt 
cattcaagga actactacca tgaatctgca ttctgntgcc cacactgagg ncttagtaga 
taattgggtg gtctgaaaca cctattatct cttatntctg gtctctangc tggnatgtta 
attcctctga aatgntaaaa gtaatgggtg anaccngaaa aagaaatttc aatnacagat 
caanntgggg ngcatgtatn attttcaagc gtcaaaatgg aataagggaa gantnctgga 
tacctgcttg gaaaaggaag natgtgtatn 



<210> 464 
<211> 748 
<212> DNA 
<213> Homo sapiens 



<400> 464 

gnngtgtctt tgnaaagcct ttggggaann gncnccttct aatgcttggc tatcgntctt 
tacgcagnnc ccatcgattc gaattcggca cgaggccggc cggcgacgct ggcgacgctt 
tcgcccctga ggtagtttgg cgaccgcgaa gaaggaaaaa gggcgggcgg gcggctgtcc 
tctcaccgtc ctcaccccgc gaggcccggc ccgctcctcc gtcgtggatt tcgcggcgat 
ccccccggca gctctttgca aagctgcttg aaacttctcc caaactcggc atggatacga 
ctgcggcggc ggcgctgcct gcttttgtgg cgctcttgct cctctctcct tggcctctcc 
tgggatcggc ccaaggccag ttctccgcag gttggntgct tctttcgttc tctcctctgg 
gggctctgaa gtttcaccag gtggacgctg gggagcgggc tcccgagcac ttgtctacct 
nccgccagtc ctgacaactt ttctggccaa cctacccagc ttcgcttggc tggcgagcgc 
atctgctgct ggggttcgcg gtgcaaatgg agacgcagtg gtggccagag ggtgatggag 
aagacgggaa aagcgacagc cacgctnctg gcttgaagcc gcaggacgca aataacttac 
tttggacctg acagttctac gttgntgtgg angccctgtt tcctggaaat aaaactcaaa 
atggtggttt tttggaaaaa aaaaaaat 



<210> 465 
<211> 863 
<212> DNA 
<213> Homo sapiens 



<400> 465 

gggnnnnnnn aanggnnnnn ggnnnnngtc ccgttccaan gaccnngaga tcgnngncgc 
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tccanaagaa aggcggtgng aattcggcac gagacctgta ccgcctggcc actggctgtc 
accggcgtga tgagctgccg gtgtttgaac gcngcctatg cngggacttt cccggcanan 
nggcnngaan atggccncca tncaggaagc cgcccagaac ctcctnggnn acacnacttn 
agngccttcn agtccgntgg nacccggncc aagccccggc aancnctgcc ccgggtcncc 
gttcccaagg ccaaccagcc ctgggnaccc ccggggagcc gaaacnctgg ggctnggana 
ccgngantga gagncncact tttcnntgta nacacgggcc cagganacan ctntgctcgt 
ggccccgggg naaannnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn ncc 



<210> 466 
<211> 713 
<212> DNA 
<213> Homo sapiens 



<400> 466 

ngtctttcga gcntggngnt cgttctngct cnannanatt ggttgnggga attcggcacg 
agcctcagcc ccacaccagc tctatttcag gggtgagagt cagagagcac tgcaatatgt 
gcntcatggg atttcgattc gaagatccta gaccagggag acactgtgag ccagggatac 
aacaaaatac taggtaagtc actgcagacc gacctccctg cagtttggga aagaagctgg 
gtttgtggag aatcagagca tcttgacatg actgctgacc taaagatccc tggcattggc 
cagggatcct gtggaacctc ttctagttca ggggtgtgag cattagactg ccagttgtct 
agtgacatct gatgcttgct gtgaactttt aagatccccg aatcctgagc acctcaatct 
ttaattgccc tgtattccga agggtaatat aatttatctg gatggaaatt ttaaagatga 
atcccccttt tttcttttct tctctctttt ctttccttct ccctttcttc tttgccttct 
aaatatactg aaatgattta gatatgtgtc aacaattaat gatcttttat caatctaaga 
aaatggttta attttttctc tttactctat ggcanttcac tcaantggac aggggaaaaa 
agtaattgcc atgggcttcc aaaagaattg ntttatgntt tagctatttn aaa 



<210> 467 
<211> 732 
<212> DNA 
<213> Homo sapiens 

<400> 467 



gnnnggtnnt ctaatncttg nnnnnnnntc ncccttctaa gccntggnct cgnctnnccn 
acnancnggc ttncgaattc ggcacgaggc gagatgaact acactgtgag gtggaggtga 
tcagccggca cttgcccgcc ttggggctta ngaaccgggg caagggcgtc cgagccgtgt 
tgagcctctg tcagcagact tccaggagtc agccgccggt ccgagccttc ctgctcatct 
ccaccctgaa ggacaagcgc gggacccgct atgagctaag ggagaacatt gagcaattct 
tcaccaaatt tgtagatgag gggaaagcca ctgttcggtt aaaggagcct cctgtggata 
tctgtctaag taaggattcc atatggctct catatcattc cattccatct ctgccaagat 
ttggataccg caaaaatttg tgtttgtgga agattctgtc tgaactcttt cattcaagga 
actactacca tgaatctgca ttctgntgcc cacactgtgg tcttagtaga taatttgggt 
ggtctgaagc acctattatc tcttatttct ggtctctagg ctggtatgtt aatcctctga 
tatgttaaaa gtaatgggtg agaccngaaa aagaaatttc aatacngatc aantttgggg 
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780 

840 

863 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
713 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



121 



BNSDOCID: <WO 9933982A2J_> 



PCT/US98/27610 
WO 99/33982 



tgcatgttga atttgcaacc tcaaattgga gtaagggaan attctggata cttgctggaa 
aggaggaatg tn 



720 
732 



<400> 470 

tatgnntttn ctaaaatncn tgggcaanac gtcctcnctt tcctaanagn ttnggcanaa 
cccttggcaa nacgccngtn acccanacnc agnnnggccg tggcgggcga gcgggcaaca 
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<210> 468 
<211> 748 
<212> DNA 

<213> Homo sapiens 
<400> 468 

gnnagnnttc taatngcttg tnnnnnnnna gacgttctaa nnctttggcn atcgttnttt 
ctncagnann ccntcgattc gaattcggca cgaggccggc cggcgacgct ggcgacgctt 
tcgcccctga ggtagtttgg cgaccgcgaa gaaggaaaaa gggcgggcgg gcggctgtcc 
tctcaccgtc ctcaccccgc gaggcccggc ccgctcctnc gtcgtggatt tcgcggcgat 
ccccccggca gctctttgca aagctgcttg aaacttctcc caaactcggc atggatacga 
ctgcggcggc ggcgctgcct gcttttgtgg cgctcttgct cctctctcct tggcctctcc 
tgggatcggc ccaaggccag ttctccgcag gttggttgct tctttcgttc tctcctctgg 
gggctctgaa gtttcaccag gtggacgctg gggagcgggc tcccgagcac ttgtctacct 
tccgccagtc ctgacaactt ttctggccaa cctacccagc ttcgcttggc tggcgagcgc 
atctgctgct ggggttcgcg gtgcagatgg agacgcantg gtggccagag ggtgatggag 
aagacgggaa aaagcgacag ccaagctcct ggctgaaacc gcaaggacgc aaaataactt 
actttgnacc tgacagtttc tnacgtttgt tgtggangcc ctgtttcctg ggaaataaac 
tcaaattggt ggtttcttgg aaaaaaaa 

<210> 469 
<211> 776 
<212> DNA 

<213> Homo sapiens 
<400> 469 

gggngntcta atgcttgnnn tgattctccg tctataacng gntaatnctt ggncctacna 
aaaggctang ngaattcggc acgagacctg taccgcctgg ccactggctg tcaccggcgt 
gatgagctgc cggtgtttga acgcaaccta tgctggactc tcccggcaga ctgcctggat 
atggtcgcca tgcaggaagc cgcccagcac ctcctcggca cacacgactt cagcgccttc 
cantccgctg gcagcccggt gccgagcccc gtgcgaacgc tgcgccgggt ctccgtttcc 
ccaggccaag ccagcccctt ggtcaccccc gaggagagca ggaagctgcg gttctggaac 
ctggagtttg agagccagtc tttcctgtat agacaggtac ngaggatgac ngctgtgctg 
gtggccgtgg ggcttnaann tnannnnnnn nnnccnnnac caantctncn nannannnnn 
ccnacnnnta aaantnncnn ncnnnnncan nnnnnnnnnc cnnnnanncc nnnncttnnn 540 
naancnnnnn nnnnnnnanc nnnncancna nnnnnnnnna nnncnnncnn nnnnnnnncn 600 
nnnnnnnnnn nannccnnnn nnnnantnnn nnannnnnnn nnnnnnnnnn aaannnnnnn 660 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nncncnnnnn nnnnnnncnn nnnnnnnnnn 720 
ccnnnnnnnn nnnnnccnnn nnnnnntnnn nnnnnnnntn nnnnnnnnnn nnnnnc 776 

<210> 470 

<211> 765 

<212> DNA 

<213> Homo sapiens 
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gctcttgagg agtgagactg cnggagatnt gggccgtgcc aaagagatgg atgagactgg 180 

tgctgagttc atcaagagga ccatcttgaa aatccccatg aatgaactga caacaatcct 240 

gaaggcctgg gattttttgt ctgaaaatca actgcagact gtaaatttcc gacagagaaa 300 

ggaatctgta gttcagcact tgatccatct gtgtgaggaa aagcgtgcaa gtatcagtga 360 

tgctgccctg ttagacatca tttatatgca atttcatcag caccagaaag tttgggatgt 420 

ttntcagatg agtaaaggac caggtgaaga tgttgacctt tttgatatga aacantttaa 480 

aaattcgttc aagaaaattc ttcanagagc attaaaaaat gtgacagtca gcttcagaga 54 0 

aactgangag aatgcannct ggattccaat tgcccgggga acacagtaca caaagcccaa 600 

ccagtcaaac ctacctacgn gggggactac tccagactcc cgnacncctt cacgtcctcc 660 

tccatgctga ggcgcaatca ccgcttctgg gncaagaagt tanaaacnct gggaaaaact 720 

acctncgaca agaaggggan catttanatt tacccnaaat gaana 765 

<210> 471 
<211> 820 
<212> DNA 
<213> Homo sapiens 

<4UU> 471 

cnnnnngggn nnngnggcgn cntccnaaan ccggggcgac agngccnnng ttccaacaga 60 

ccnggngngc cgncngngcc ccanacagca ngggnggggc nnnggggnnn cnncgncnnn 120 

cnnancnaca aagaactcaa caagaaaaaa acnaacccca caagcgggca aaggacgnga 180 

acagacantn cccaaaagaa gacatacaag caaccnaaaa taatcnaaaa taagnnncaa 24 0 

aaagaaaaaa ngcnagacag agnngngana gnactnagna aaaagngana tctagcggcn 300 

annagnangn nngnnnacgg ncngnnncna agaaanagnc nctggnnccc aagcnggagn 360 

acagcggcgc aagcnnggcn cactgcaacc gcgaacnccc gggctcaagc gaaccnccag 420 

cctcagcctc ccaagnagcn gnnaaaggca ngcaccacca cacccgacna aaatancngc 480 

nancaanaac ananaanggc nccccngngc nnanncagga aanaaacacn cnnangcnnc 54 0 

ngaaaanaan naancncncn cnnnacaaaa aaacnnnagc cnnagaacaa nnnnggaggc 600 

ggaanacggn nnancccgac anganaanga nacnanngan gganganngg gaccaaaccn 660 

cancccggga anggcnnngn aaaaaaaang ccnnnaaann gggggaaaan ncggngnang 720 

ccnaaagggc cnnaaanggg gaaacccnan naaaangccg ggcanannan aaccnagcnn 780 

nancnanccn nccaangggg nannnccncn nncnaggccg 82 0 

<210> 472 
<211> 738 
<212> DNA 
<213> Homo sapiens 

<400> 472 

gnngtgtctc taatgcttgg ctactngttc tttccgccga acncttgcta atgcttggcn 60 

ntcgttcttt ctccacnnac nnngcnntnc gaattcggca cgaggtcaca ganatnaaag 120 

tccaatcata ggggctggnc cnacntctnt gctnntccct gcangantca tangatcagn 180 

nanaccgtgc gnntttgnaa gcntttcaaa tgtgntacca tcgngttact tncnnnggca 240 

cctgntgann tnggttgnac tnnncnggat nctccaaanc caccnnnncn atgggntnng 300 

tgngcatgng ntggnncann nacagannna ganactttaa ngaannngnt tntgcaaccn 360 

tnggnnctag caancntgan antnccaggg nnggccacna agctgaaaat nnatgttana 420 

ncnnatgntg naatctctag natgacttcc ncannnancn aaactnangc anggctgcna 480 

tgttagaanc tanaggccna atttcttntc natgnaacca ntntatgctt ttaagaccnt 540 

caactgtnnc natgaagccc atntacatna ttncggtaat anggctatnc ttaaannnaa 600 

ctgctgaaaa tnatgatnca nctacgaaat cctnncancc ncatntggct naatcattac 660 

caaccatttg acaccnncat ngnctaccca cntgcattnc catgaccnan tccantgcca 720 
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738 



cccgcncaga tntacctt 

<210> 473 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 473 

tatgnntncc taanagagtt ntgnnacacg gcccgccttc tnaaancttc ctaatncttg 60 

ggcgctcgtt ctntctncac ncagnnnntg cggtncgaat tcggcacgag gtccttttga 120 

accaccccaa agaactcaac atggcaaagc aaatggtaaa agcttcccga ctgttctact 180 

ttgggtccgc gcgaagccca ctcacgtgtg atctgtgttg cccctgggag gcccggggcg 240 

accggaaaag ggctctctca agttctgaaa agagaatctg ccaccagatc gaatttcgac 300 

ccctgagctt gttcggacgt atggtccaaa ttcagattaa ggtggtcacc caacccgaga 360 

tgtcaggaaa ggccttctgc agagaaaatg tccccccacc cgccatctgc agccaggtgt 420 

gtgccacacg gcagccttcc cgaaacatag tatggatttt aaaaatgtgt ntatttttgg 480 

ttctcaacca ctttataacg tattttttaa tttattttgt aatgtcttgt tttgaagtat 540 

tgctgctatc cttggtatcc ttcccactgg ttttatcact ganttatttt gngaaagttg 600 

ncactaatgt tctatgtcaa aatcaaaagt atttaatgaa atactanntc tatttaatgt 660 

ggntatggaa ccagctggaa acacaaaaca aacagtgatt gacancaagc tgggcccaag 720 

agncaggtca ttttgnacat atgccaataa ac 752 

<210> 474 

<211> 752 

<212> DNA 

<213> Homo sapiens 



<210> 475 
<211> 742 
<212> DNA 
<213> Homo sapiens 



<400> 474 

ttgcanacnn aatanttgct gtaaaagtcc cnnctttttn ccctttctaa tgnttgngcg 60 

ctcgntctnt ctccacnagn nnntgcgttn cgaattcggn tctnagccca tgccgggatc 120 

ttcccacacc cgtcctcaca gatccagccc cagcccctgt cttcccaggc catctctcag 180 

cagcacctgc aggatgcggg cacccgggag tggagccctc agaacgcatc catgtcggag 240 

tctctctcca tcccagcttc cctgaacgac gcggctttgg ctcagatgaa cagtgaggtg 300 

cagctcctga ctgaaaaggc cctgatggag cttgggggtg ggaagccgct tccgcacccc 360 

cgggcgtggt tcgtctcctt ggatggcagg tccaacgctc acgttagaca ttcatacatt 420 

gatctccaaa gagctggaag gaacggaagt aatgatgcca gtttggactc tggcgtagat 480 

atgaatgaac caaaatcanc ccggaaggga angggagatg ctttgtctct gcagcagaac 540 

tacccgnccg tccaagagca ccancagaaa gancctcanc cccagacagc acggnctaca 600 

cgcanctcgt gnacctggat gacntggaac anaatggtan cnaatgtggg accacngnct 660 

tgtancccna ggacaaggcc ctncnangct tgntggangg gtcnantcng anaaatggng 720 

gccactgccc aacccgcang aaganaacaa nn 752 



<400> 475 

gntttctntt aatncttttn naaangcgnn ntttaccntt ctangnntgn gnctcgttct 60 

ttcccacnna nnnnncggtn cgaattcggc ncgaggtgaa acagaaagtg gagatgcttt 120 

ccttgacctg aagaagcctc ctgcctccaa atgcccccat cgctatacaa aagaagaact 180 
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cttggatata aaagaactcc cccattccaa acagagcctt catgcctttc tgaaaaatat 240 

gacagtgatg gtgtctggga ccctgagaag tggcatgcct ctctctaccc agcttcaggg 300 

cggagctcac cagtggaaag tctgaagaaa gagttggata cagaccggcc ttccctggtg 360 

cgcaggatag tagatccacg agagcgtgtg aaagaagatg acttanatgt tgttctcagc 420 

cctcagagac ngagctttgg agggggctgc cacgtgacag ccgctgtcag ctcccggcgc 48 0 

tcangaagtc cattagagaa agatagtgat gggcttcgtc tgcttggtgg acgtaggatt 54 0 

ggcagtggga ggataatctc tgcccggacc tttgagaagg atcaccgctt aacgataagg 600 

acctgcggga cttgagagac agagaccnan anaaggactt caaggacaac gtttcangan 660 

anaanttttg gagaaagtaa ncntgtcttt tggtgancgt anaaanaaat gattcttacn 720 

cnnaanaaga acccgaatgg tt 742 

<210> 476 
<211> 1122 
<212> DNA 

<213> Homo sapiens 
<400> 476 

gnnngggnnn cbctaaaagc cgggnnnnnn nnngaggnnc ttctaatnct tctaatggtt 60 

ggctctcgtt ctttctncac gcagcnnngc gnnncgaatt cggcacgagc ctgcagccac 120 

taatgcattg tgtatgataa caaaaactct ggtatgacac attttctgng atcattgnta 180 

attagtgaca tagtaacatc tgtagcagct ggttagtaaa cctcatgtgg gggtggggtg 240 

ggggtgtatn cctngnggga nggnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 00 

nnnnnnnnnn annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 420 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 480 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 54 o 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 660 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 720 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 780 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 840 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 900 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 960 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 102 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080 
nnnnnnnnnn annnnnnnnn nnnnnnnnnn nnnnnnnnnn nc 



<210> 477 
<211> 747 
<212> DNA 

<213> Homo sapiens 



1122 



<400> 477 

gnngtgcctt tgaaannccn tttgnnnnng nggcccttct aatgctnttn cgntcgnggg 60 

gtcgaactcg ccccacncng cnaggcgggg gctncaagcg attctaaacc acctatgagt 120 

atttctttta gggctcactt aaatacatgt ntgngnntac tgggggctag ccngaataat 180 

tttagatctg atcaggtngn ngctnaaatt ngaaaaanac cnnntngatg cttaaagaat 240 

tngcntccat ttttgagtct aaatctttta aaatntactg ngatccacat ctagngaaat 300 

gtcngtgtca anatattctn gatnatcgct naaatccnca ttaatactcn ttnggggtnn 360 

nnnatagngg aacttcntag nnntncnaaa agcacatngn cttcctgnct ccgctgctcc 420 

cacagnnggt nttgnaactg ggnaaatcag nnnnnngata gcgngngnnt ntnaganaaa 480 
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ntngatncac acatncttrm nnctcagncn ncacatngat tgaacactct ggccaagatg 540 
ctgnggngga tgangttgga gttcgannga agaagccngc gctggcctgg cttgnaagac 600 
ccnngncttt cccntnccct cnctngaaag ctgcccngac ngaggccnaa ngnaaatggn 
tganngnncn gtcnngcccn cttcngncnc ttngaaccnn nnagnggnnc tnnnngnacc 
cnnngnnntn cgngnaaccg nnccngc 



<210> 478 

<211> 746 

<212> DNA 

<213> Homo sapiens 



660 
720 
747 



<400> 478 

gnnnnnngcg cgnccttcta atgcttcnta attnnctngg atactcgttc tttctncagg 60 

natcnnntgc gnttcgcaag gagnagagtg atagnaattg gcagtgaaat atacgaacca 120 

ccctcctgcc ctctgggttc acaatacgtg tacacttgac tgtgaagtgg ctgtgagagt 180 

gggtggagag ttcttctttg accctcagcc tgcggatgcc tctagaaacc tcgtgttgat 240 

tgcaggagga gtcggaatta accctctgct ttccatcctg cggcacgcag cagatctcct 300 

cagagagcag gcaaacaaaa gaaatggata tgagatagga acaataaaac tattctacag 360 

tgcaaaaaat accagcgaac tcctgtttaa gaaaaatatc cttgatttag taaatgaatt 420 

tcctgagaag attgcatgca gtttgcatgt tacaaaacag actacacaaa tcaatgcgga 480 

actcaagcca tacatcacgg aaggaagaat aacggagaag gagataagag atcatatttc 540 

aaaagagact ttgttctata tttgtggccc acctccaatg acagactttt tctccaagca 600 

actggaaaac aaccatgtac ccaaagaaca catttgcttt gagaagtggt ggtaggaggc 660 

aagaccaaag gcaggaaaaa attaangagg tgagatctac tcaaggagag ctcaaaaaaa 720 
aaaaaaaaaa actngggccc tttaga 



746 



<210> 479 
<211> 750 
<212> DNA 
<213> Homo sapiens 

<400> 479 

gnnnnnnnnn nngngnnnnn ttctannntt cntattnnct nggagctcgt tctttctnca 60 

ggatcccntg cgattcgaat tcggcacgag ggtagactgg ctagggatcc tggacccagg 120 

gttccacgta gcaacacctg ctgagttctc tgggttttct tcctgcctca tgtagcccag 180 

acttggagct gaagaagctg gaaacatgga aacaccaaca gctacagacc aaaaaaagtc 240 

ccaacaaagg cctgtcagtc tgccagcctg ttctgtggat ttccaactca agattgcagc 300 

atcaactcac acctgaagtt ctggcttccc tacaaacttt gaacttgcca gtccccacaa 360 

tggcataagc caattcctta aaatgaatgt ctagttctag ataatgtgtg tattctactg 420 

gttctgtttc tctggagaag cctactaata gatcatttgt cttagtcaat tcaagctact 4 80 

ggtacagatt accatagact gggtggttaa aactaccaat cttattactc acagtttttg 540 

gagtctggaa agtctgagat cagggttcca gcaggattga gttctttggt gaacatnctc 600 

tttctggnct acagaatact gggttacttt aagtnggaaa aagtagggtg aagctgggtc 660 

ntttggcctc ttcttttaag ggggactaat tcatgaaggg ttccaccctt attgacctat 720 
tttaccttnc caaanggntt ccattttccn 

<210> 480 
<211> 714 
<212> DNA 
<213> Homo sapiens 
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<400> 480 

gnnnnnngnn nngnggnnnt tcnaccgttc ttntgaccta gnctcgntnt nnccnacnna 60 

gctaggcttg ngaattcggc acgagataac acacatcaca gtatgctctc agaaatttct 120 

ttatttgaac cctataccaa tatctgntga tcaatgacca tttttgctca gcatggagaa 180 

acagtgccct gcatgaaggg tagtgagaat aaaaaggatc ttaccacctt tatcatgagg 24 0 

gtggctttgc tctctccatt ccaagttgtt ctctgttcta gaaagcagat gtagtagaca 300 

tctactgttt ttgcctaaac agaatccctt tttccttttt ttgttaaaag tactcatccc 360 

taatattaca ttgttctgga aggactgaaa ataacagaac tcagcaccat gatcggaccg 420 

ggacaatcag attatttcat tcctcancaa acggagatcg atccgaaaag tggaaatatg 480 

agctcttctt tggtgttggc atatggaccc tgagagaaag aactttaatt ttttctcttg 54 0 

gactgcaata aagtatagct gcctaaaata cgtttcctga ccttggangt ttgnccacaa 600 

tcggtgaaat aaangcaaga cgtacacttg gatgaaaaaa aaannnnnnn naaaaaaaac 660 

tcgaccttta nactatnnga gtcgatacnt aatcngactg atagatcatt gnta 714 



<210> 481 
<211> 742 
<212> DNA 

<2I3> Homo sapiens 
<400> 481 

agccnttcta aangccnttt gnctnngnnc cccnttccta anncntggct aatncttggc 60 

nactcgttct ttctncacgc acccatcgnn ncgaattcgg cacgaggcat gaaaggagtc 120 

ggaagcggaa gcggtagccc ggacggtgct gtggtgcaag ggcttgtgga aaaattggag 180 

aaaaccaagt ccctggccca gcagttgaca agggaggcca ctcaagcgga aattgaagca 240 

gataggtctt atcagcacag tctccgcctc ctggattcag tgtctcggct tcagggagtc 300 

agtgatcagt cctttcaggt ggaagaagca aagaggatca aacaaaaagc ggattcactc 360 

tcaagcctgg taaccaggca tatggatgag ttcaagcgta cacagaagaa tctgggaaac 420 

tggaaagaag aagcacagca gctcttacag aatggaaaaa gtgggagaga gaaatcagat 480 

cagctgcttt cccgtgccaa tcttgctaaa agcagancac aagaagcact gagtatgggc 540 

aatgccactt tttatgaagt tgagagcatc cttaaaaacc tcagagagtt tgacctgcag 600 

gtggacaaca gaaaacagaa ctgaagaacc atgaagagac tctnctacat caccagaagg 660 

ttcagancca atgacaagac ccancaagca naagagccct ggggagccct ctgctgatcc 720 

caaanggcaa aaaatggggc cn 742 

<210> 482 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 482 

gnnngnggag nnttctaatg ctttgntcta gagtcncnnt tctaanggct tggnaatnct 60 

ngctcttgtt ctttntgcag gatcccatcg attcgaattc ggcacgaggc caagcctcgg 12 0 

cctccactgc acctgctgcg gagtggcacc tttgcctgca aggccctcta ccccatggcc 180 

cagtgtcatc tcagcagggt ctttggccac tcaggaggcc cttgtggtgg gttgctcagt 240 

ctgtccttcc ctcatgagaa gctactgctt atgtccacag accaggagga gctgtcacgc 300 

tggtaccaca gtctgacttg ggctatcagc agccagaaaa actagaggaa tcttatagat 360 

tccagaactc aggatacctc agggataggt cacagccaag agtacaaagg aatcttcagt 420 

actgaacaaa acagaaccct tcatgatttg acaaaggtca ctttctgttt gcctggacca 480 

agctactcca gatcatctga ccaactctta aaaatcacgg ccaggcacag tggctcatgc 540 

ctgtaatccc agcactttgg gaagcaaang tggcaggatc attccagccc aggagtttca 600 

agancagcct ggcaacacag tgagtgagac cctgtctcta tttaagaaaa aaattattaa 660 
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gaaattttat taaaaaagga agaatcagga aaccaaagtc aaccccaact taaccctcaa 
tgaaccagcc ctaacacaga tgangggatt tg 



720 
752 



<210> 483 
<211> 849 
<212> DNA 
<213> Homo sapiens 



<400> 483 

gnnnnnattn ccctttnaaa tnccncngaa ancccttgga agcactaccn ctcngacccc 
tttggaacgn cgactnctnn atatatcnng gatataatag gtgataagtt ctgncaatta 
gtaacatcng gaaaaaacag ctnngncctg ggngaaaaag gatgccaaaa tngcctggaa 
aagagcagng gagaggagtc cgggagatgn gngatgcatc gggacgcanc atngntnaac 
attcactggg tctgccaaaa atgtggattt gngggctgct tagatngtta caaggcaaaa 
ggaaaggaaa gagttctaga gataaaagaa ctatatgctt ggatgaagtg tgtgaaggga 
cagcctcatg atcaccaaca tttaatgccc aacccaaaat tataccnggt tctgntttga 
cagacttcta gatgccatgc acactcttag ggaaaaaata ttgggattaa ancccatngg 
cattggacta acaaacagga atttacaagg tnggaaantt ttncnaccaa tgaaaggggg 
gatcncaagg ttttccagaa nggntcntaa tcncaggnaa taaaaattnc tctngggcaa 
gccctgagtc ttaancagca aaaanactcc tcccgaancc tgnagaaaaa agggggggca 
gccaggcccn naaanggaan gtnaggcccn agatnaacaa ngtnacctcc ncccagnaaa 
ccccannccc caactggnac cngggnaacc cacaacnttt gcngaagncc aaaaaagncc 
nnnagangga aaaaaaaaaa naananaaaa aacctnnnag cccctaagaa accttagggg 840 



nggcccncc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



<210> 484 
<211> 1098 
<212> DNA 
<213> Homo sapiens 

<400> 484 

gnnnnnnnnt ttnnnnnttt ttgnaaaanc ccccttttgc naaatngncc ctttttntgg 
cangggatcc ccatntttat ntcggacatt ttcgggccac cggaaggggc cgggggcccc 
cgggccncca ggnccgggna aaggcccccc ttgggcggcc cccggncggc cccaatgggt 
tccaaaaagg gaaaaaaaaa aaagggggaa cctgggaagt tggcccanga aaangnaaaa 
aaaggnaagn aaaccttccg ccaatgggaa tggggaaaaa taattttttc ttgaaaaacc 
caaaaaagga atggttattt ttcaaattta aaaaaggaac nttgggaaga aagaattggc 
ttcccacncg cagaaagggc attactggct atgtcaagta aaagaagtcc ttcaaagctt 
agttgatgat ggtatggttg actgtgagag gatcggaact tctaattatt attgggcttt 
tccaagtaaa gctcttcatg caagggaaac ataagttgga ggttctggaa tctcaagttg 
tctgagggaa gtcaaaagca tgcaagccta cagaaaagca tttgagaaag ctaaaattgg 
ccgatgttga aacggaagag cgaaccaagg ctntgcaaaa agagcttttc tttcactttc 
gagaccaaag gggaaccagc tnnaagggcn agaaaagttn gaaaaaaatt ccaaaggaac 
tggtggaatc ccccaaaagg tttggttggg gaaagaaaaa ttcccgcccc aangccaaaa 
tttaaaaggt ttngccccca aagggaaaag ncttgncctt taacccagga attggggacc 
ctgggantta aaaccnataa ttttcccgcc naattnnaaa aaattcnttt nggggncccc 
naaaanggna aaaaaatttt nggggggttt tggnaaggna aaaatttnaa atttggattt 
ngaaactttt ttngggaatt ccccagaaag aacttttgac cttccnttng acctnaaaaa 1020 
ttttcccttg ggggggtgna anggatgttc ccaagctttg tggnatattg gtaaaatttt 1080 
naaccttttn tncttacc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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<210> 485 
<211> 798 
<212> DNA 
<213> Homo sapiens 



<400> 485 



<210> 486 
<211> 785 
<212> DNA 
<213> Homo sapiens 



<210> 487 
<211> 797 
<212> DNA 
<213> Homo sapiens 



<400> 487 

ttgtnnnncc cttttnaaat ncctttggct anttgntctn tttgctngat cccatcgatt 
cgaattcggc acgagnnngg actaccttnc aaaaccnggt ngggaagcnt gttacagaan 
tgatntctan tcccctgnat tctggatgct gcagaccaac acctgccnac aanacncana 



60 



gnnnnnnant nnnntttnaa atccttnntg aatcctttga antaccatcc cnttttncga 
attnggcacg aggaaaggtg gcgcgcttct cacggctgag ttgctgcgcc ttgcagacgg 120 
aagctcccca caggcagagc tgcttggatg tgtgagtcat gaaccagaga agccccgctc 180 
catgagcagt gactccccan gccctgtgac ctccctcctn cttgcagctc ctcctggcac 240 
cagtccccag ggctctcctg ttggtagttc ctgcttttct tcttggaaat tcctcgtgga 300 
cctcgagatc tttaccctaa aatagttctg ttgaatttca ccctggcaat gtaaattgat 360 
agcttatctt cacagatgcc agacaatgga caactcacca tcagtcctct gctcacctga 420 
gacaaatgca tgtctgattg cttcctctgc cctattgntt atgtgaaaat gcagattcac 
tgagccagac taaggcatca gtgactgttc ctctacctgc ctctcacatg gagattgtgt 
attcagtgaa aggctgatca aagacccaaa ggaatgcaac agtttatctc ttatctacct 
atgacctgcg aactggccaa caacccagtt gttgncgcct tttcagacag aaccagtgtc 
OWLUttViaC9 ^uunaaatg gatgtcctgg ngtctnccta atatgtattc aaaagcaagc 720 
tggggcctng accacccttn ggcacatatt cctcanggac atcattcctg angctatatc 
actggcatgt ccttaanc M a 



480 
540 
600 
660 



780 
798 



<400> 486 

gnnnnnnttt gaaanccctt tcnaatnctt ggcattgntc tctttgcagg atccctcgat 
tcgctgacaa cttgattggg ttctccttca ggtttgaagc gccctcgaga agtgtctaaa 
ggagacagtt gatagccaaa caacagtttt ggattcactg actgattatg aaagaagcag 
tagactggta tcaagaatca gtcagcaagg aggccctcac cagacgccag tgccatgttc 
ttggacttct cagcctccat attcatgaac taagtttttg gaatccttag gcttccacgt 
gtggaaagcc tgagctaacc tactggagga tgagccatca cctggagcag attcaggcca 
tcctagttga agcctcccta ggccaagcaa ccgtccaact accagacatt gaccattcag 
ccttgaacat tcagcacaaa gacaaaacag accagaccag aagagtccca cagaataggg 
gaaactattc agagaaaact taagccacta agttttatgg tgttttgttc tgtagcagaa 54 0 

gcataggcat actgacaata caaaccgaaa tccttctaac gtagtggacc ttttcangcc 
agcatttttt ccttgaaaac ctggagcatg tatccatctt atagcagaga tcactttcac 
aatggttggg ctcttggatt tgaattgatg atgtaatgag ccctctttnc ngattgnaac 
ttaattactc tgggnatttg ntggattccc aaccttctaa tatttacttt tcctctttan 
taanc 



60 
120 
180 
240 
300 
360 
420 
480 



600 
660 
720 
780 
785 



60 
120 
180 
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cacacacann caancantat catgtaagac agnncgntna ntnnnnnatt ntnatncttn 240 

nncatttacn cantnttgta nantggntca tgngtctata natnnttgta antattntnt 300 

gananangac ganantctga atcttaagca tatgctccat cnttnnatat gctntggtgg 360 

agaggctngc cntnattcat nttnncatgg agncaagttt aatgcctcta gantacattc 420 

tgggcttcaa gcatncttat ttnnaactcc ctgagtgatg ggtggataaa tcnaacattg 4 80 

nctnagtggn ntcaagacaa ctttgntggt ggttttgntc acaatcatga aaatggttnn 540 

gccagataaa tattttgata ttagntttcn tttttnatat anngcggtag gtttgaattg 600 

nacnttnaaa tgnntngggt tgtnaagaca ntggnttnca atnnaattta tnacatgaat 660 

tggngnctcc cctttggnga aaccttaaag aanttntgna tacttcttca taaaagggtg 720 

tgngatttng naantttcgg gggttttnaa tttttnntga agcttatttc ntganaatnt 780 
acttggntta ccaagcc 



797 



<210> 488 
<211> 762 
<212> DNA 
<213> Homo sapiens 

<400> 488 

caaatcnntt gctctngttc tttttgcagg atcccatcga ttcgcgacag ctctccaata 60 

ctcaggttaa tgctgaaaaa tcatccaaga cagttattgc aagagtttaa tttttgaaaa 120 

ctggctactg ctctgtgttt acagacgtgt gcagttgtag gcatgtagct acaggacatt 180 

tntannggcc caggatcgtt ttttcccagg gcaagcagaa gagaaaatgt tgtatatgtc 240 

ttttacccgg cacattcccc ttgcctaaat acaagggctg gagtctgcac gggacctatt 300 

agagtatttt ccacaatgat gatgatttca gcagggatga cgtcatcatc acattcaggg 360 

ctattttttc cccacaaacc caagggcagg ggccactctt agctaaatcc ctccccgtga 420 

ctgcaataga accctctggg gagctcagga aggggtgtgc tgagttctat aatataagct 480 

gccatatatt ttgtagacaa gtatggctcc tccgtatctc octet tccct aggagaggag 540 

tgtgaagcaa ggagcttaga taagacaccc cctcaaaccc attccctctt caggagacct 600 

acccttcaca ggcacangtc ccccaaatga gaagtctgnt acccctcatt tcttnatctt 660 

tttacttaaa ctcaagaggc agtgacaggn agtcaggggc aagacattac atttttcata 720 

ctttcccaca tctgaaaaga tgacagggga aactgeaaag cc 762 

<210> 489 
<211> 822 
<212> DNA 
<213> Homo sapiens 

<400> 489 

ttnnnnnnct nnnggnnttt enaatnettg tttctcgncc tttctgeagg atcccatcga 60 

ttcgaattcg gcacgaggat tttcgaaact cttcagctac ttgccctttt ttatctgaaa 120 

ccatcatacc ttctgaaaga aaaaagcata tcttcattga cataacagaa gtgagatggc 180 

ccagtcttga tacagatggt accatcntnt atatggagag tggcattgtg aagataacat 240 

ctttagatgg teatgeatae ctctgcctgc ccagatctca gcatgaattt acagtacatt 300 

ttttgtgtaa agttagccag aagtcagact catctgeagt gttgtcagaa acaaataata 360 

aagccccaaa agataaacta gttgaaaaaa ctggcaaaat ctgtatacgt ggaaatttac 420 

cangacagag actgaagaat aaagaaaatg agtttcattg ccagatcatg aaatccaaag 480 

aaactttaaa gaagatgagt tgtgtaaatg gaactgaagg gagggaagag ctgccttcgc 540 

ctggtacaaa gcacacatgt gtatacacat gggtcaagca gtgctggtct gtggctgcct 600 

gtccagagga atgggaaata ttcctttgtc tttagcactt catttttcta aataaaaatc 660 

anccaatatg tctaaaaaaa aantttnttn ataataaacc tngaagcect nttanaacct 720 

tntnntggag gtcctnnttt accntatgat tcccggaact tggataagga atcccntttg 780 



130 



BNSDOCtD: <WO_^9933982A2J_> 



WO 99/33982 



PCT/US98/27610 



gattgganat tttgggccna aaacccncna nncttggaat cc 



822 



<210> 490 
<211> 789 
<212> DNA 
<213> Homo sapiens 

<400> 490 

ntgtaancct tttcaaatcc cttggctact tgntctttct gcaggatccc atcgattcga 

attcggcacg aggccggacn gtgactctgg rmacgcttgc gnccntnacg tagntngnng 120 

accntgcang anggaanaan ggctggccnn cngntgtacn ctnaccgtcc taaccccgcg 180 

aggtccaggn ccgctccttt cggngnggat tctcgcggaa natccctccg gcagctcttt 240 

gcaaagctgn ttagaaactt ctcccaaact cggcntggat acgactgcta tagggctcgc 300 

tgctgctttt gtggagctct tgctcctcta tccttggcct ctcctgggat acggcccaag 360 

gccaagtntt cacgcangtt ggtacgctta tttcgttctg gactctgggg gctntgaann 420 

ttcaccacgt ggactgctgg ggancgggnt nccgancact ngnntacctt acnccanaat 480 

ctgacaactt ttctggacaa cctacccanc ttcaattggc tngnqaqcnc ntcnantari- 5 ah 

ggggnntncn gtgcaaatgg agncncaatt ggtgggcaaa tngttgatgg ncaaaacggg 600 
aaaaagcaac nnncaangct tttggctnaa agccgatang acncaaatta nttnctttgg 
accttganaa tttcctcaan nnttttnagn anncnctttt ttncttggan aaanacttaa 
aagtgaacga ttnttgggaa anaaacaaac tataataact naaagctttt ntaaaaaaaa 
annaatnnt 



<210> 492 
<211> 804 
<212> DNA 
<213> Homo sapiens 



<400> 492 

tcnaaatccc ttttgnnagn ttcncncttt gtttcccttt nctnggctnc ttgttctttt 



60 



660 
720 
780 
789 



<210> 491 
<211> 790 
<212> DNA 
<213> Homo sapiens 

<400> 491 

tccaaaatnc ccttggantn attccccctt ncaatacctt tccttngnac actcccngtt 
tngntngatc ccatcgattc gaattcggca cgaggnaaca aagaaggaat gtcttcctca 
tgtttnggtc tatagaagac gttaaagaaa acttccagaa agtgggtttg aggcatgagc 
caccacgcct ggccaaagga tttaatgaat taatggatgt acagtgctgg ggctgttatt 
ctagggcctg cattgagact cacattttgc catcaaaagc cttttaagag gtggaggttg 300 
cggtgagctg acatggtgcc actgcactcc ggcctgagtg acagagtgag actctgtctc 360 
acaaaaaaaa taatgccctt taaataatga ataatagtga tagaaaatgt catttcttgg 420 
acaaatgaaa aattgaaatt aatgtatata attagatatt attagctact cttaggtagc 480 
ttcatttgtt gaaagtttga caagtgaatg aagttcacat ctggaaatcg ttgaacattt 
ttcgttcatg gaactcaatg gctacgttag tcgtttatgc ttttcactgt tgtggtaggg 



60 
120 
180 
240 



540 



gctttggaaa gtnaatgcca tcaacaatgg atacagaang acctggattt ggaataaggg 660 
caaaaattta ttttgatggg gctgaattgc tctgccaggg agcattttgg gtattgagat 720 
gaaaatggcc tctctttgag actgagctgc cacctggcaa attattgnct gcttaanoqt 
tctctttatn ya 



780 
790 



60 
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120 

180 



tgcaggaatc ccatcgattc gaattcggca cgaggtcctt ttgaaccacc ccaaagaact 
caacatggca aagcaaatgg taaaagcttc ccgactgttc tactttgggt ccgcgcgaag 

cccactcacg tgtgatctgt gttgcccctg ggaggcccgg ggcgaccgga aaagggctct 240 

ctcaagttct gaaaagagaa tctgccacca gatcgaattt cgacccctga gcttgttcgg 300 

acgtatggtc caaattcaga ttaaggtggt cacccaaccc gagatgtcag gaaaggcctt 360 

ctgcagagaa aatgtccccc cacccgccat ctgcagccag gtgtgtgcca cacggcagcc 420 

ttcccgaaac atagtatgga ttttaaaaat gtgtttattt ttgtttctca accactttat 480 

aacgtatttt ttaatttatt ttgtaatgtc ttgttttgaa gtattgctgc tatccttgnt 54 0 

atccttccca ctgtttttat cactgattta ttttgtgaaa agttgtacac taatgttcta cnn 
tgtcaaaatc aaaaagtatt taatgaaata ctagttctat ttaatgtggg ntatggaacc 
ancttggaaa cacaaaacaa acaggggatt gtacaagcan gcttggggcc caagnaaggt 

caaggttcat ttggttacca tatgccnata aaacctcanc gaanttttaa aaaaaaaann 780 



600 
660 
720 



nnnnnnaaaa aancttggng ggct 804 

<210> 493 
<211> 800 
<212> DNA 

<213> Homo sapiens 
<400> 493 

ggnncnnttt nccccccttt tgaaaacccc ttttgggnga ancccncttc tttnaaatcn 60 

cttggctact cgctctttnt gcaggatccc atcgattcga attcggcacg agtatataac 120 

aacttttgct ttcaaagttg ggtgggacta gaacacacaa tggaaggatg gagtcaggag 180 

acctggattc ttgtgcccgc tctggctttt acagtctgcc taactctatg cagtcacttc 240 

ctgccagcct gtttccttac ctacaagagg gagagacact ccctggccag cctagttctc 3 00 

agggtgaacg aaaggtcatt atcactgcat cctctagtca tttgcttctt cgctaattaa 360 

cacatcttga gcacctgcga tgttccagga acaggagatg gcagcgtgca agataaaagt 420 

ccctgacttc tagagactgc atgttagtgg caatcggcgt ctacccggcc ttcaataaac 480 

tactgaatga aggaaaattc tacctagcac cagacacaat tactgggttt ctaaaatgga 540 

attattcccc cggccccctg catccagcag cctgctgcag ggaagctcct ccgaagctgt 600 

aggcaggagc gggacaaatg cttgctatca gcttcacaga atgttaccta agtactattc 660 

ctacacagcg ccttacagaa caaacagtaa aaaccaaatg gnaagcatgc acnggcttaa 720 

aaactcaaac ttcctaacta ctcagtaatt anganggtca ttttacccca aaatagaatt 780 

ttcnatttat ccaataanaa 800 

<210> 494 
<211> 757 
<212> DNA 
<213> Homo sapiens 

<400> 494 

nggnttcnnt ctaactnaaa cngttnggna actcncctct ntctgtngat cccatcgatt 60 

cgctaacaag cgattctaaa ccacctatga gtatttcttt tagggctcac ttaaatacat 120 

gtttgtatat actgtattct agccagaata attttagatc tgatcaggta gtagctaaaa 180 

ttagaaaaaa acaaaataga tgcttaaaga atttgcatcc atttttgagt ctaaatcttt 24 0 

taaaatatac tgagatccac atctagtgaa atgtcagtgt caaaatatta tagattatag 300 

ctaaaatcca gattaatact catttggggt tttttatagt ggaacttcat agtaatacaa 360 

aaagcagatt gtcttcctgt ctccgctgct cccacagtag gtattgaaac tggtaaaatc 420 

agttttttga tagtgtgtgt atataagaaa aaatagatac acacattctt ttttctcagt 480 

caacacattg attgaacact ctggcaaaga tgctgtggtg gatgangttg gagttcgaaa 540 

agaagaagca agcgctggcc tgccttgaaa gaacccgaaa gtctttccca ttcacttctc 600 
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tagaaagctg ccaagacaga ngcagaaagg aaatggatga tagttctgtc aagcacactt 660 
ctgntctcnt agaacttaga aatggttcta agagaacaga agttatngag aacagttcnt 720 
gtggaattca acatcttggg tgggacncat tggcttt 757 

<210> 495 
<211> 756 
<212> DNA 
<213> Homo sapiens 

<400> 495 

ggnnnnnntc ttttcnaatg cttggctctc gttctttntg caggatccct cgattcgcaa 60 

gagagagtga tagaattggc agtgaaatat acgaaccacc ctcctgccct ctgggttcac 120 

aatacgtgta cacttgactg tgaagtggct gtgagagtgg gtggagagtt cttctttgac 180 

cctcagcctg cggatgcctc tagaaacctc gtgttgattg caggaggagt cggaattaac 240 

cctctgcttt ccatcctgcg gcacgcagca gatctcctca gagagcaggc aaacaaaaga 300 

aatggatatg agataggaac aataaaacta ttctacagtg caaaaaatac cagcgaactc 360 

ctgtttaaga aaaatatcct tgatttagta aatgaatttc ctgagaagat tgcatgcagt 420 

ttgeatgtta caaaacagac tacacaaatc aatgcggaac tcaagccata catnacggaa 480 

ggaagaataa cggagaagga gataagagat catatttcaa aagagacttt gttctatatt 540 

tgtggccacc ttcaatgaca gactttttct ccaagcaact ggaaaacaac catgtcccaa 600 

agaacacatt tgctttgaga agtggtggta ggaggcagac aaaggcagaa aaaattaaga 660 

ggtgagatct actcaggaga gctcaaaann aaaaaaaaaa aaactnggac ctntagaact 720 

atagtgagtc gtnttccgta gatccagaca tgataa 756 

<210> 496 
<211> 744 
<212> DNA 
<213> Homo sapiens 



<400> 496 

ctttnaatcc cttgcactcg tcttntgnag gaccttatcg attcgaattc ggcacgagat 60 

aacacacatc acagtatgct ctcagaaatt tctttatttg aaccctatac caatatctgt 120 

tgatcaatga ccatttttgc tcagcatgga gaaacagtgc cctgcatgaa gggtagtgag 180 

aataaaaagg atcttaccac ctttatcatg agggtggctt tgctctctcc attccaagtt 240 

gttctctgtt ctagaaagca gatgtagtag acatctactg tttttgccta aacagaatcc 300 

ctttttcctt tttttgttaa aagtactcat ccctaatatt acattgttct ggaaggactg 360 

aaaataacag aactcagcac catgatcgga ccgggacaat cagattattt cattcctcag 420 

caaacggaga tcgatccgaa aagtggaaat atgagctctt ctttggtgtt ggcatatgga 480 

ccctgagaga aagaacttta attttttctc ttggactgca ataaagtata gctgcctaaa 54 o 

ataccgtttc ctgacacttg gaggtttgcc acaatcggtg aaataaaggc aagacgtaac 600 

actggatgaa aaaaaaaaan nnnnnnaaaa aaactcgagc ctntagaact atgtgatcga 660 

ttcgtagatc cagaatgata gatcattgtg agtttggaca accacactng atgcagtgaa 720 
aaaatcttat tgngaattgn gatn 



<210> 497 
<211> 772 
<212> DNA 
<213> Homo sapiens 

<400> 497 

gnttgnngtn taantttnta aggatccctt tntntgaanc cctttctgca ggatcccatc 
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gattcgaatt 
gagcacagat 
aaaaaaaaca 
atggtcaact 
tcttcacctt 
gttaagtata 
tgtttccttc 
gaatagtttt 
caataccttc 
gaaaatttct 
taaccaaaaa 
ttgcaaacca 



cggcacgagg 
catgccactg 
tagaatttgg 
gcttcagctt 
gtttccaagg 
catttatttt 
attgtttctg 
tttaattagg 
caaagtaagg 
ggaatattca 
ctgcacatta 
ataaatactt 



caggagnaat 
cactccagcc 
atcctttggt 
tgttttggca 
ctgaggaaca 
ttactgtgac 
gaaaggaaag 
gcttattttg 
taatattcag 
cattcgaaga 
ttctttgnac 
aagctattta 



cacttgaacc 
tgggcaacaa 
cgggttctcc 
accccctgcc 
gaaagtagcc 
ttgttcagga 
ttctattaat 
aaaaattctg 
agacagttgt 
tccttattat 
attttcatta 
aaaaaaaaaa 



ctggaggttn 
aacgagactt 
caaattcttt 
cgaagtcgca 
tctgttttga 
ccacatttta 
attgntttac 
agtttaattc 
tggtgatcag 
gaatgtcttt 
tatagngtta 
aaaaaaactc 



cggttgcagt 
cgtctcaaaa 
tgaggtgtcc 
tataggctgt 
ggaggtggaa 
caaaatgcct 
tttgaatata 
aaatgtatgc 
atggcttaga 
gacttaaatc 
acaagcttan 
nc 



<210> 498 
<211> 773 
<212> DNA 
<213> Homo sapiens 



<400> 
nttnagcnta 
catcgattcg 
tcatcctcct 
gggtatcagt 
gctaccaaag 
gtaactgcca 
cactgcctat 
ggtattcttc 
tnactgtctc 
cttctgcaat 
ctcantttcc 
aaattacact 
gtatgtctag 



498 
nnagccgttg 
aattcggcac 
gtgagctctc 
ctggccactt 
taatgttnta 
atctcactta 
gcagcaacct 
ctgatgcccc 
tctgccagtg 
atcatcttct 
agtccctgaa 
taaaatggtc 
agaacaagca 



tantgaagcc 
gaggacccag 
tgtaagtctc 
ggctttctgg 
aaaccattat 
gaataaaatc 
catctttacc 
tagtacacaa 
agattcctct 
aaaaagagcc 
tggccattcc 
accttcatga 
gnttggctca 



cntttgctac 
gtagaccagc 
tntcttgccc 
tttgccccaa 
accaagttac 
cggactcctg 
gtttctgcct 
caactccttc 
tctggtatta 
tttnaaaatc 
ccactttcat 
tgggaaggca 
tagtaggcac 



ttgctctttt 
tcaagagttc 
atcaccacat 
tgtggtctat 
tatccttgtc 
tgaagcacag 
tgctcactcc 
ctgctccaag 
cctntgcttc 
accttttcta 
agccacttaa 
attaattgcc 
tcaacaaaaa 



tgcaggatcc 
atgttctttg 
ccctagtact 
tcttgatgca 
aaaaccccca 
nataaactgg 
cttcagcgcc 
agtaggaaaa 
attgctgaat 
ttatgcccta 
ttgctatctg 
tttgtcactg 
ttt 



<210> 499 
<211> 735 
<212> DNA 
<213> Homo sapiens 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
772 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
773 



<400> 
gcttcaatan 
agagtaccca 
aatggatnaa 
canccagtga 
aatatngacc 
catgctgtgc 
atgcattttg 
aaagaacagc 
.acgcttcaag 
naaattncac 
atcttacant 
ttttatcctc 



499 
ctttttctaa 
nanttgcnag 
naaaggcccc 
cangaatgag 
ttgtgaactt 
agactgttga 
ttcacgtctg 
ctanagnaaa 
gactctctgt 
cagctctnat 
ttnccaanna 
atnaatgttt 



ngctcttttt 
gagtntnntn 
ttagaatggc 
tttnaaggga 
ccttgaagat 
aactntgaat 
gagactgcaa 
tggctngaga 
gtggctgggg 
gagantattn 
naagncatga 
aaatgtaaaa 



gcaggattcc 
actgatntag 
aagatnncat 
tgggttttaa 
ggcancatgt 
gaaggggacc 
agcatacagc 
ngaaccacat 
ncctgactat 
tgtcgcgtgt 
aatgtgacat 
nnnnananaa 



atcgattcga 
ccaggtggca 
ttacnnagag 
ctacagaccc 
ctgagaccgg 
atagagtgag 
ccacaggatc 
tcccatcact 
ngacccacca 
tcaggatctt 
tctgcttgaa 
aanactcgag 



attcggcacg 
atnatgagtg 
gtccnagtgn 
agnctctgcc 
aattatggga 
ggataagctg 
tggaagagag 
gaacagggan 
tatggtcana 
antgaaggac 
naagacnata 
ctntnaaatn 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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tngtgagttn anang 



<210> 500 
<211> 926 
<212> DNA 
<213> Homo sapiens 



<400> 501 



naatncttgg ctcttgttct ttntgcagga tcccatcgat tcgaattcgg cacgagaatg 
caaagggctg cagttctcat tcaggctact ttcaggatgc acagaacaX tatLcattt 
cagacttgga aacatgcttc aattctaatt cagcaacatt atcgaacata tagagctgca 
aaattgcaaa gagaaaatta tatcagacaa tggcattctg ctgtggttat tcaggctlcJ 
tataaaggaa tgaaagcaag acaactttta agggaaaaac acaalgcttc tatgtatta 
caaggcacct acagaatgta taggcagtat tgtttctacc aaaagcttca gtgggcSca 
TelTtttll aagaaaaata tagagcaaat aaaaagaaac agaaagtatt IclllTctTt 
attcaa^af 3 aa9a3aCtt3 ^eaggc. ggttttcagg acatgaacat aaaaaaacag 
attcaggaac agcaccaggc tgccattatt attcagaagc attgtaaagc ctttaaaata 
aggaagcatt atctccacat tagagcacag tagtttctat tcaLgaaga tacag^ac 
taactgcagt gcgtcccaag cagttatttg tatcagtctt attacagagc tttaagtcca 
aagatatcaa atatgcacgg gctgcacact aatcagtctt ctatca ccaa ^ tcca 

<210> 502 
<211> 784 
<212> DNA 
<213> Homo sapiens 

<400> 502 

ttnntttttt tggttaccct ttgctctngg nctttttgca ggatccctcg attcgaattc 



735 



<400> 500 

"! aa f CC ! t ttCtactnct cttttgcagg attccatcgn ttcgaattcg gcacgaggat 
ctctatacta gtgaacagtg ccagttccac actttggact tagaactgtt ctctagSt 
tgtaacacag aatactgtca atccctaatt tacttaatgt tacttattgg aagtggggct 
gatgaaatac gcacaggagg gaaatctaot gtgtttaggc acaggcagnc ccagtgSta 
aggagatcat attccaaang gttgtcagtt ggntgtttgc aacctggaat gtaSSct 
ttagagacca ngttatccat ggtggttagg cccctagagc agctggaaaa agatgatcaa 

llntntlllt S t9aCat Cnaataatgt a -aagtttg ctaalggaat cLcStcat 
atntnatatt gnttccaggg aaggttgtnn nttaanntnc cntcttngtg ncatantgga 
cnntcccntn ccagtcatnt ncntnannnc tngggcnngt ntngnnttn? tatnttSS 
r7"' a " Ca aca ^cata tcncccctng ctaaaattct ttnanannaa nttetcantt 
tctcccttta ctanaanttt ngtntttnnt contttanta tttnnnccta tntntntcgt 
tcnnanatnt cattnnntnn ttntnngctn ntnnatcacc cttanctcnn tctoa^tat 
SSE£ tlZtT n t att - tCnCt tntnatnatc nttccnnntt gtnt^ 
"" t "' C "? "nntntnct cncatctctn tcntnttctc ngctnannnn actccnnnnn 

cn^at™ ! nC atatnctnct ttngntatat annnnnntnt ntacntanct 900 

cnnnatnnca tnncnatatn nttngt 

926 

<210> 501 
<211> 706 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
706 



60 
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qqcacgagcc ttccacggtt atttcacaga tatggagagc tggaagcagg gagtgagtct 
Sgagtgttg gaattgtaag ggatcagaag cagggatcag aagcagtggt gaagttcatc 
caccataaaa cacacaggtg actttgcctt gaatctgcag gactgaagcc aactcttggg 
cacagaccct tagtcccttc cttggccact ctaagtcaga tagtccagag <~aggccctt 
tgggatgtga caccgagata aatcagagaa aagctgtgaa gcttggggaa cagagggact 
Stggtgaag taggtggtct gcagtttcta tcttcttggg aaaagcaagc tggaaaagtg 
aacStlgt? ggtaggccat agtgctccca gctgggtgac ataatgacca cacagcacag 
SatgtStt agcaactgtg tggtggagta gttgtgggct ggacaaatca atcgtgtgga 
aattgttagg agttttatta cattaaactt gttaacctaa aataccatca aaaaaaaaaa 
"ncSanS nccncccacc nancntncna aaaaaancct cganccttta -aacnnntn 
gnngaggccn tatttacgtt anattccaga cnttgaatan ggatnccatt tgnattgaaa 
ntttngggcc aaacccccaa ccttngaatt gccattngaa aaaaaaatgc cttttatttt 



gnnt 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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784 



<210> 503 
<211> 764 
<212> DNA 
<213> Homo sapiens 

ttrmtnt^rttgaancctt tttctacann cncctttgca gatcccncgt tcgaattcgg 60 

caSagagac aaagaaaagg tggcaatcat agaagagttt ntagtaggtt atgaaacctc 120 

tctaaaaaac tgccggttat ttaaccccaa tgatgatgga aaggaggaac caccaaccac 180 

attactttgg gtccStact acttggcaca acattatgac aaaattggtc agccatctat 240 

tgctttggal Scataaata ctgcSttga aagtacacct acattaatag aactctttct 300 

cltgaaalc? aaaatctata agcatgctgg aaatattaaa gaagctgcaa ggtggatgga 360 

tgaggcccag gccttggaca cagcagacag atttatcaac tccaaatgtg caaaatacat 420 

gctaaaagcc Lcctjtta aagaagctga agaaatgtgc tcaaagttta caagggaagg 480 

aacatcalcg gtagagaatt tgaatgaaat gcagtgcatg tggttccaaa cagaatgtgc 540 

ccaggctL? aaagcaatga attaaatttg gtgaagcact taagaaatgt -ttgagatt 600 

gagagacttt tataggaaat cactgatgac ccagtttgac tttcatacat ^tgtatgan 660 

Igaanattac ccttagnatc ttatggtggg actttattta aaaacttnca nnaatgttcn 720 
ttcgacagcc ttccatttta acttcnaagg cnncaangaa ttnt 

<210> 504 
<211> 795 
<212> DNA 
<213> Homo sapiens 



ttgtacntct ttttnnaaac ccntngctac ttgttctctt tgcanggatc cctcgattcg 60 

gglatctcct agaaagttgt gattttcgag ccatatcctt ctgtggtaga tcctaatgat 120 

cctcagatgt tggccttcaa ccccaggaaa aagaactatg atcgagtaat g^aagcactg 180 

gatagcatL cttctatcag agaaatgaca caagcaccat atctggaaat <=aagaagcaa 240 

atggataaac aggaccccct tgctcatccc ttactgcaat gggttatatc aagtaataga 300 

tcacatattg tgaaactgcc agttaacagg caattgaagt ttatgcatac tccacatcag 360 

ttccttcttc tcagcagtcc accagccaaa gaatccaatt ttagagctgc taaaaaactc 42 

tttggaagca cctttgcatt tcatggctca cacattgaaa actggcactc ctcctganga 480 

Itgg?ctgg t ngttgSttct aatacacgat tgcagctnca tggngcaatg tatggaagtg 54 

gaSctaSt: tagtccaatg tcaagcntat cattttgntt actcagggat ^angaa 60 

Lagaaaggt ntcagcccag gacgagccac cttoaagcng ttaanaagcc agcaattaca 66C 
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ttcacagtcn ccaggaaana aaaggncagn cctatccccc ctttncctgq caaaaaaccc 

22222 :™ 9 ° cttta9occt »™» '»o° 

795 

<210> 505 
<211> 774 
<212> DNA 
<213> Homo sapiens 



<400> 505 

tnntntnntt nantngaacc ctttnctett gctctttttg caggatccct cqattcaaat 
tcggcacgag cacaaggaga agaagttaat taacattgaa ngSgagaaa acJtct^ 
agaacttgaa ttgggcottg gaagaagaac agccattcaa atJgKagaa ttgtggtagc 
tat^L^ ! ggtaOTaaa ^atagatct ccagggacag tagtcatfgg gt^ggggcac 
tgttggaatt taaggttgga aggatatatt ggagcccctt gaatacggta acaSSca 

=2=52 35SS S=5S 222E 22222 2= 

gss 22222 22222 ™ 9 2522: : = 

22222 22222 22222 22222 22™ t9t ™ 
22222 22222 ™" — S22 

ctantattcc anaataaatt tcaacggcct gcntngnctn ctttacaact ttnt 



<210> 506 
<211> 796 
<212> DNA 
<213> Homo sapiens 

<400> 506 



?"Scatc^ " ta9a !f tc ^^tgtaco ctttttgcan nancncgnnc tncttgcagg 
ntoccatcga ttcgaattcg ccacgangtt atattaaatt attctttgtt tttctttttc 
ttttaataaa gcctgcaagt tactaaattg tagtttcata aattctgLg taaJgtatc^ 
t^^TT 9tgCCaaag 3 tgaaaatgat gctttctcta acagagaaat tcttagtgac 

Sctitt: ss::s t c jzesz 

cggcctattg gatgaatcat ttgccgtagg ctaaatcaga ctgtagggtt tgtaataaat 
ttatggagta tgtgggtata gaaatcatga atctagcatt tgtttSga gltSalgca 
tagtcttaag ggtanatcag aaatgacaaa tgaattcaaa acctagcagg ^gcattgSa 
atgtgtgccc agttntgttt tggaaatggc agttccttgg ggtcSgtS ctact^Sa 
aatttgcaat antgtnctat tgtntgtaat ttcaaaatS ftaagattat cccccg^tcg 
cccaagtaaa acctgtnctg cccaatanaa tcctggantc gnngagaaat cg^tccat? 
ognngntcaa ctcgggatnc ntcgncttaa naaaatnttn £2g£i£ cc^cat^ 

n^ngg n"™ *™ —™ 



<210> 507 
<211> 774 
<212> DNA 
<213> Homo sapiens 

<400> 507 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
774 



cto»t«n« t t „ g ,a„c=t t»g=t«tgt "tttttgcg ? »«"»™ '"S 

SEX ™~ SSSS ™S .Uj-j 

cac« r a 9 t ? „tt a tg= aa. = 

«a-^» IT^Z «cl" g ,tt! W t= tgg3 ta g a 
-cttcctS SS" g a gg SSa? g a= accttlta.c ct ggg a, g ca atataact^ 
££££ aatc,^ gg « g ca g „ t g aaa g a g aa g= c, 

S3£™ ™«a 

: g "ra 9 a a? tgg .?«a »cca g aaa=c ™« -«t «« 

SS£ -™ 3mm 

<210> 508 
<211> 724 
<212> DNA 
<213> Homo sapiens 

^cllll'ZL^ g e C a=t» g « ct«« g ca ? .a,™ .^cjaa^c SO 

= == iSi iiS Si s==S s 

■■■■■■ i 

aantactgnc tgggctggaa nangaactnn taaaagt^t agn tcnntnt 660 

cccnannttn nncchntant cnnngggntn angtggtnnn ™ c ™f&* j^...^ 720 
ggnntgncna tngnncnnat gnanacttgg ggttcannaa ncntttccnn atgnaancng ^ 

ngtc 

<210> 509 
<211> 803 
<212> DNA 
<213> Homo sapiens 

tnnnnnt^rtttcnttcgt tetngntttt attacatcag c««ttctt tttgcggtcc 60 
Zr tooca attcagagac acacataaga aactggaaga agagaaaggc aaaaaggaaa 
aaaaaagaca ggaaattlag aaagaacgga gagaaagaga gagggagcgt ^aaagggaac 180 
Sgaaaggcg agaacgggaa cgagaaaggg aaagagaacg tgaacgagaa aaggagaaag 240 
lacgggagcg ggaacglgaa cgggataggg accgtgaccg gacaaaagaa ^gaccgaga 300 

s£s5 = = 33=S sss £= tf 0 

£== = == ==S SiS 555 5 

atacatacga accgaaaaaa aaaaaaaaaa aactcgagcc tnttaactac a 9 c 9 a 9^ 
atSgtaga tccagacatg ataagataca ttgntgagtt tggacaaccc ccacttgaat 660 
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ttSa^tt Tcltllltt: aatt: 933 " tattgetttt tttgtaacca 720 

gttcaagggg aagtt^tgg — ttg cnttcatttt nttntttcan 780 

803 

<210> 510 
<211> 789 
<212> DNA 
<213> Homo sapiens 

<400> 510 



a a ™ J£~ t a ™ 9a 

SSSS a 9 ^a a a 9a S ST" 9 "*" 3ag ~ S= 
cca?tg|cct tgtcatcgca Sea? 3 * 93 * atctaccc * t ^ccctccct cccacagtta 
ctgaaa^gct gSt^tg K£2£ f*"^ 
ctgccccanc t gggg tgacg ££££ ^22^ taggagctgc 

c: g ™i zsss i? g ? c :t:: c t t 9ggca ; cca 

ttc tgt ag t ? .ILZIZ 5ES£ ™ gt fc a c £~ T"" 

tgggcaaaaa catgggggta gttctgagtt gttagaaS tt l" T 33 ^^99 
gaaaacacgt tgtgccttta taco^t-L* 9CCagaaat 9 tttctgaatc aagtttgttt 

gctttgggtt ttttttggtt tgatttaaS EST °" taanacccaa ^tgataa 
ggtttcoat tggtttggtt ttttgcttca ttttacccat tcatgcotag 



<210> 511 
<211> 776 
<212> DNA 
<213> Homo sapiens 

<400> 511 



agatXgtgg ttgcttc^ .tggoaSS ££T "^aagt 

ssrs =S s ™- ssss 

SEES =£= ?"« <~ SS= 

=b s£ £== ass =2= S5ss 

tataagang? gcctgctt" ccSSS """"" g=t=tgg=ca 
>wJ. gctgJt^gc tttcStcc. °"° at9att 9«t:c ctgagcctco 

tetttttnt L£4£ ESTE 222°" « 

3 ttt„,aaaa ^ £ ™ ST"* 



<210> 512 
<211> 917 
<212> DNA 
<213> Homo sapiens 

<400> 512 



60 
120 
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240 
300 
360 
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480 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
917 



«. aa ^^tatt ctttttgcag gatccatcga ttcgaattcg gcacgagggc 
ttatttcata aactattgtt ccutttycay a => M «- 1- 1- «- ecrac ccccaactcc 

«- „ a ^fi- tcaactttaa ctcctgatat gcagcgacag aattttcggc cccca^tu 

=S £-5 =52 5S= == 5= 

15 S ==£ =S 2=55 =2=5 5== 

cStccccc Sgggtccc agcagcaatt cggctactcc ccaaggcagg annanaanca 

Ss ™ = 52= S= == 

" = 2=2 =aa = = 

~^tnaa ^tttttt ^ggaaaaSt naaancntng gaaaancntt tnnntngttn 
tttnnttnaa nnantttcc aaaacntttt naannnntnn nggttnnnan 

"^S: nSt^ccc ccaS^nt nnanngnanc ttttnnantt ngggantaaa 
^^a nggggnnttt tttnngnnna atttnnnnnn annnnnnnan nnangggnnt 

ttngnnngna aimntnn 

<210> 513 
<211> 780 
<212> DNA 
<213> Homo sapiens 

^SSSS— =S=3 i 

=S Sis ==== =s=s Hii « 

n L= t c Cg » t «™c a= . 9 «™= ™ s jjjj^ 4jo 

gacccaggca ccaattgttc ttgcaaactc * 3 ...^^t- aa tataaaga 480 

=5= = 5=SS 3= £ £ 55= = 
=25= =K =522 ==== === = 

ss= = === sss S= *«« - '•• 

<210> 514 
<211> 793 
<212> DNA 

<213> Homo sapiens 

^t^rttaLxcatt ttgctactng ttotttttge aggatcccat £ 

SEES S=S =325 22222 2= - - 
2=222 22222 22222 22252 22222 SS£« ;s 

22222 gaacaatttg taaatgtctt aatttgatgt -»t«c^ *aaacaagag 360 
aLaggtttt taacttanag tagccctaaa atatggatgt ^tataa ^tag^ ^ 
ttggaactgt atctgagtaa cagaggacag ctgtttttta accctccccc g g 
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ggcatgaata taaaacat^t ttatttcaat aaS atCCCtga ^ 9»«ttttga 

tttgggggta caacttcatt a S tt^ CCtgtgtaaa Sttactatgg 

tatggttggg gtgqaccaat llt^f ttaagtggga agtgggtgaa ttctactttt 

caaSaaSa laa ^atcaag agtgacaaat naaggttaan ggatgattcc 



<210> 515 
<211> 770 
<212> DNA 
<213> Homo sapiens 

<400> 515 



iii 5E5 55= = ssss 2= 

SKS ~ « % «ta t a 9 3g , taaaa c. ssks: 

a g ac tgg aca tLtltfaS ta't™^* t !* t9 » aa « »9«agcagt cgactccg, 
atggaSaco aaSaaa*," a3Ca "— »tatatct gg 

c tg Jaa gtg SaM £2a,aal £[™ 

aaaaaacctt taaactataa tc^m-,-^.-^ _ ccnccaaaaa agcttccaaa 

acag tgagtcgtnt acntagatcc aacatgataa 

<210> 516 
<211> 825 
<212> DNA 
<213> Homo sapiens 

<400> 516 

tttccagttt tanttttttc ancttttnga tonntttgca ggatccntct ttt«,« fr ^ 
ggcacgagat tctccctaaa ttgtnoatcc -,-h*,::. sgacccntct tttcgaattc 
gcntgctnan tgctntgtag nncct? tct g IJEoSS tn " gtgCtg 

tgtgctttnt ntnaaanggc anaccagngn tgn^tgSt ttaataatS gagaaCllnga 
tttatccaca cctcrqcccot tt-n^J^ <-gnnccgnnc ttaatgatgc agancctnac 
acantttctg aaaScacnt tt^S ? agtaangnac ^tatttggc tgatggctga 
ttancatctc atcSgcatt c^tccS 9MatMM » gnctgccagg 
tcttgatggg gtctScaa S^£f? t ^nanaaag gganatntca gaattatatt 
tcncggtft? g^ctaaaaan atSS ^tcgaaann tcttaganaa anctatgtgn 
ttaaacg^ga gaSac^at 1 fl ^ anat * a <~cc tggaattncc tgananangc 
ntgttttg?c tggcccSt CCCtnCnaag 9ttntt g gna angcccnJIt 

atattng^gg ngltacttJc SSSS ~ attnaagc c ^gggngag ctttgnatnt 
ncccnaLS gacntttgct tttC^" ^ c ^gaac ancttxmata atgcttntcn 
gaaaaagtc? ? tttaM £ £~ ™ - ^— cacctccant 



540 
600 
660 
720 
780 
793 
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<210> 517 
<211> 1444 
<212> DNA 
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<213> Homo sapiens 

ctctcn^c^cmmtc tctnncnntn nnnnntnntn nnnctcnnnn cnnnatctnn 
«tnn nnnnncntnn cntccntctc ttntntngct ctcntntctc ntncatcttn 
SScf ttnt cntnntnntc nntcntcnnn antnctnnnt tctncctnnc canctntcca 
tnntntactn tcnntnntct ggctnttnta tntggggggt ctatttnttn ncttaaatcg 
a^ngttcca agtctcntan cngcntctnt ctnnctntct ntgcnctncn ctggggcntt 
aatScccnn gctnttatan aagngngnaa ttaaggtntc nnntctarmg ctntgcaagg 
ctaatgSS gatccngnta gaanncgnta catgttggga acngacanct tnctgcncaa 
aonggStna ggcanngrmn tntgcaaann ctcnnntntc nnancttgnn tcncgtagan 
Sg^Scccc tgaattLnn ancnngganc nttaaatnnt ntngnggtac |ann=-cnn 
ncgnnnnnnc gnntannccn canngttaan tgcncccnna nnnantcaac tctntnntcc 
tSSnaacn Lnttantot annatnntta cnnntnagnt tttcctcnct nacnnctctg 
t^t^ttnn atcttntnct tctcncttna tttaf tctc ntntntntnc tnccctnatc 
Stctnctac nctctnttcc ncttctccct nncntctctc atcatatccc acgcnactna 
ncccctct™ ctcttacctn nntnctctcn tcntatctcn nnaccctctt tctntntctt 
at^cnccS tcctctactt attctcctcc tattntncca ctcacccttc ntntntctnc 
ncS^cttn tnctatttnt actntcncta ttcctncntc tctnntgnct cccaccccct 
ctt^tctcn ctctcctnnn nnnactactc tcaccntctc nnctntcnct ctacnnntnn 
ana^tcctt antttcctnc tcatcacant actcttccct ctcatnntca nanctaantt 
nt^ctcac Staccactc tntnctccac tcatatnana cttctatant nctaatccta 
tcJtcttaaa cntctcotct tatcnctcta anctcctott cntcgctanc tccnntncaa 
ctcgnaaatc tctccaatnc tnccccactc taaaaatnnc ncntcngant «cacttttc 
ngngcanaat nnaacncnan t ccnc t ccct ttagctatct ctctanaaac -cntttctc 
aacaggnacc nccctntntc tcnaaatcct catnctncta ctttatatnt cn " aa J^ 
cncctntgta anagcatctc nctntccncc aatnnanatc tccctnctcc natanatntn 



anat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1444 



<210> 518 
<211> 706 
<212> DNA 
<213> Homo sapiens 

ctaatg ^tg > g^gctcgtt ctttccgcaa cancccngcg antcgaattc ggcacgaggt 60 

ccgaagaaaa agactgtggt ggcggagatg ctctctccaa tggcatcaag aaacacagaa 120 

caaatSgcc ttctcctatg ttttccagaa atgacttcag tatctggago atcctcagaa 180 

aatgtatS aatggaacta tccaagatca ogatgccagt tatatttaat gagcctctga 240 

act?cctac! gcgStaact gaatacatgg agcatactta cctcatccac aaggccagtt 300 

tcctgtggaa agga.gcagt gtgtagctgc ^gctgt^ tet£*»g 3 

cttctcagtg ggaacggact ggaaaacctt tcaacccact gctgggagag acttatgaat « 

tagtgcgaga SaccSgga tttagactca tctccgaaca ggtcagccat cacccac 480 

tcag?gca?t tcatgctgaa ggattaaaca atgacttcat ctttcatggc tctatctato 540 

ccaaactgaa attc^ggggg aagagtgtag aagcagaacc caaaggaacc atcaccttgg 600 

agctccttga acacaatgag gcatatacat ggacaaatcc cacctgctgt gtgcataata 660 
tcattgtggg taaactgtgg atcgaacagt atggcaatgt ggaaat 

<210> 519 
<211> 734 
<212> DNA 
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<213> Homo sapiens 
<400> 519 



tngtaccaat tatctgctgg ctanntagoc taaanagntt ggtcngggcg aattcacrcac 
5"* gELIS 

agaaatgnga aagatgcacc aaaaar^^ II? 9 C * a ttctt S a SSa 



60 
120 
180 
240 
300 
360 
420 
480 

ag.a.tgnga aa 3 .t g cac= ^ c l^ ££5Sc «cSt™« " C " 3 «^» 

sr— ~< :» 

734 



<211> 701 
<212> DNA 

<213> Homo sapiens 
<400> 520 



§5 =5 25™ 55| S£S = s 

Sgctg^ S^S"! -tggagcaga cattatccaa ggggagagaf agagac^g 
agcgccgagc tccaggttta agaatgaata tgtggccgac cctgtatacc acacti-n-t-h 
gaagagctct ttccagaaga agtgccagaa gagacagtag tctgcataca 

:zt:i: 9c : u%»z*i sssss 

9S«t«ggtff acagggtgtg tctctgacag tggtaaatc^ SttccaSa 

SSSS ££££ 3 ~ = « ~S 

«: ssss sss =— *~ 



<210> 521 
<211> 784 
<212> DNA 
<213> Homo sapiens 

<400> 521 



ES ~* ~* = s= 

: ~ = =5= =5= £= 

cagtgttctt actacaccac ggaaggctgg ggagcccagg ccctgatggc ccccgtaccc 

S 5 F= =5 = 5S=£ 2 I 
= 5=SS s~ S2S= ~~ 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
701 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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aagaaggagg aatctgaagc cttgggtaag gatttggggc acagtaccag gaggggggct 
tggtgccaga cctcatgagg aagaaggatt ttcctatgta cagagaaggg 9 ac ccctgtc 
cSttoggan gtgctgtgca aacctaacca aagttactaa cccctctggt ttctgnggtt 
acacaaangg ggataaatac aaagctttnc ctnaactagc caattctatt tgggtttcct 
gagt 



<210> 522 
<211> 719 
<212> DNA 
<213> Homo sapiens 

<400> 522 



ttctaatttn aatccttnaa atnggttctt tntgcaggat cccatcgatt cgaattcggc 
acgagagaac acaggtgtcg tgaaaactac ccctaaaagc caaaatggga aaggaaaaga 
ctcatatcaa cattgtcgtc attggacacg tagattcggg caagtccacc actactggcc 
atctgatcta taaatgcggt ggcatcgaca aaagaaccat tgaaaaattt gagaaggagg 
ctgctgagat gggaaagggc tccttcaagt atgcctgggt cttggataaa ctgaaagctg 
agcgtgaacg tggtatcacc attgatatct ccttgtggaa atttgagacc agcaagtact 
atgtgactat cattgatgcc ccaggacaca gagactttat caaaaacatg attacaggga 
catctcaggc tgactgtgct gtcctgattg ttgctgctgg tgttggtgaa tttgaagctg 
gtatctccaa gaatgggcag accogagagc atgcccttct ggcttacaca ctgggtgtga 
aacaactaat tgtcggtgtt aacaaaatgg attccactga gccaccotac agccagaaga 
gatatgagga aattgttaag gaagtcagca cttacattaa gaaaattggc tacaacoccg 
acacagtanc atttgtgcca atttctggtt tggaatggtg acaacatgct ggagccaat 



<210> S23 
<211> 710 
<212> DNA 
<213> Homo sapiens 

<400> 523 



<4UU> 3^.3 

tnnncttcaa atcgntngct cttgttcttt ttgcaggatc ccatcgattc gaattcggca 
cgagagatta tgagcatgta gaagatgaaa cttttcctcc tttcccacct ccagcctctc 
cagagagaca agatggtgaa ggaactgagc ctgatgaaga gtcaggaaat ggagcacctg 
ttcctgtacc tccaaagaga acagttaaaa gaaatatacc caagctggat gctcagagat 
taatttcaga gagaggactt ccagccttaa ggcatgtatt tgataaggca aaattcaaag 
gtaaaggtca tgaggctgaa gacttgaaga tgctaatcag acacatggag cactgggcac 
ataggctatt ccctaaactg cagtttgagg attttattga cagagttgaa tacctgggaa 
gtaaaaagga agttcagacc tgtttaaaac gaattcgact tgatctccct attttacatg 
aaaattttgt tagcaataat gatgaagttg cggagaataa tgaacatgat gtcacttcta 
ctgaattaga tccctttctg acaaacttat ctgaaagtga gatgtttgct tctgagttaa 
gtagaagcct aacagaagag caacaacaaa gaaattgaga gaaataaaca ctggccttgg 
aaagaaggca ggcaaagctg ctgagtaata gtcagaccct aggaaatgat 



<210> 524 
<211> 730 
<212> DNA 
<213> Homo sapiens 



<400> 524 

ttnnnnnttt aancnttcaa atcnctaggc tacttgttct ttttgcagga tcccatcgat 
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tcgaattcgg cacgagccca cactcggaca ctgtggaatt ctaccagcgc ctgtcgaccg 
TaZT CttCatCttC ^ctatctgg agggcactaa ggcacagtat ctlgcagccl 
aggccctaaa gaagcagtca tggcgattcc acaccaagta catgatgtgg ttccagaggc 
acgaggagcc caagaccatc actgacgagt ttgagcaggg cacctacatc tactttgalt 
Traill 9 ? " gCCagC9g a ^aggaag gcttcacctt tgagtaccgc tacctggagg 
« <~agtgacac cggcccctnc ctdtacccac ccccttcccc cgcatgctga 
Jcatccccct ! aggtaa9gS «**gccctg gaagactgga gggaggcccc aagccacggg 
gcatccccct ctcocaggaa gcagggaggg ggccgggagg ttttcctctc aagccccacc 
ctgggggccc gggggcgagg gctgccccct cctcccctcc ccagtgaggg acattttttcr 
""""" 99— *t.tt tatgaataaa tag^tlS tgaaaaaaat 
attal^ag ancn tcgnncctct taaaactata gtgaagtcgt 



<210> 525 
<211> 711 
<212> DNA 
<213> Homo sapiens 



<400> 525 

gcngntnttn antttcaaat cgctnggcta cttgttcttt ttgcaggatc ccatcgattc 
gaattcggca cgaggataaa tacctcagcc cctcgccttc ctcaacJcac ctggJagtc 
ttottaggat ctgatcccag ttttctggaa gcaatcctac cccagcccaa gcttcccaga 
gtcgagcctt aatccttctc acttctcagt gtcagagcag aaatgaatcc tggggttgac 
tgtgtccatt cgggttatta gcagctaaga agcccagacg agtagtgtga gSgcct^gg 
gagcctcagt gagggcactg ggactggcct cactctcttg cccccagcct agtgggctt? 
ctcctctgtc tctccggtgg ccccaggcaa tcgactgcat cacgcaigga cg^gttgg 
agcggccacg tgcctgccca ccagaggtct acgccatcat gcggggctgc tggcagcggg 
T c tZ Tt a ! gCCaC39c -tcaaggatg tgcacgcccg gctgcaagSc c^ggcccagg 
cacctnctgt ctacctggat gtcctgggct agggggccgg ccaggggctg ggagtggtta 
gcccggaata ctggggcctg ccttagcatc ccccatagct tcoacfgccc cagggStc 
tcaaagtatc taattcacct taacatgtgg gaagggacag gtggggcttg g 9 " t9atC 



<2X0> 526 
<211> 692 
<212> DNA 
<213> Homo sapiens 



<400> 526 

tacangctac ttgttctttt tgcaggatcc catcgattcg aattcggcac gagagaacaa 
g9aag !f gga ******** ccagaagaga acagggcgga ctctcagg^c 
gaaaagagtc aaaccttttt gggaaaatca gaggaagtaa ctggaaagca agaagaSat 
ggtataaagg agaaaggggt cccagtcagc gggcaggagg cgaaagagcc agagagttgg 
ggC ^ ggc -Stgggaaga gcgaggagca gggaa|alga gLtgagcIt 
Itllir I CaatgCCC ^ c tctgatagcc cctgaggact ctcctcactg tgacctgttt 
aaaoS C cat ^ctcgt gactcagatt cccgggactc agacagagtc cagggctgag 
Sttctc^ : C9C TT gtCtCCCttg ^agagccca tcagatgctc t Jccagccc 
t tgggCtCCtt "tgactgag gagtcacctg acaaggaaaa acttctatca 
gtactttgat atgtcacagt ttcatgttta tccagttcaa tgtattttta aatttttcct 

£SS T "f Ctgata 9«tattgtg aatgtgtttt taaatttcca aatgtttang 
gattttcata tctttcttat gctgatttcc aa g 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

730 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
711 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
692 



145 



BNSDOCID: <WO 9933982A2J_> 



PCT/US98/27610 
WO 99/33982 



<210> 527 
<211> 769 
<212> DNA 
<213> Homo sapiens 



<400> 527 

gttctngttc tttttgcagg atccctcgat tcgaattcgg cacgaggcca agcctcggcc 
tccactgcac ctgctgcgga gtgggcacct ttgcctgcaa ggccttttnc ccantgncca 
atggtanttt aaccagggtt tttgncnntt aaggaggcct tngtggtggg tngttaatct 
ggccnttccn tattgaaaag ctcctgttat tgtccacaga ccagaaggac ttgtaacctt 
ggtcccacag tctgacttng gcttttcaag cacccagaaa acttagaggg aatcttatag 
attccagaac ttaaggatac ctcaagggat agggtcacag ccaagaagtn caaaggaatc 
ttcagtctgg aacaaaaaca gaaccctttc atgattgaca aangtcactt tctgtttgcc 
tggaccaagc tactncagat catctgacca actcttaaaa atcacggcca ggcacagtgg 
ctcatgcotg taatcccagc actttgggaa gcaaaagtgg caggatcatt ncagcccaag 
agttcaagac cagcctgggc aacacagtga gtgagaccct gctctattta agaaaaatna 
ttaagaaatt tattaaaaaa gaagaatcag gaaaccaagt ncaacccaac ttaacctcaa 
tgaaccagcc cctaacacag atgangggat ttgggactga taagctctgt gctgngtcca 
tggcccgtca nttatcaagg ttgcactttt aaatgnggta tttttatgn 



<210> 528 
<211> 757 
<212> DNA 

<213> Homo sapiens 



<400> 528 

tnaatatcag ctcttgttct ttttgcagga tccctcgatt cgoangaggg tgttcgactg 
ctngagccna gcgaancgat gcctaaatca anggaacttg nttcttcaag ctcttctgge 
ngngattctg acagtgaggt tgacananag ntaancagga aaaacaagtn gctccagaaa 
ancctgtaca gaaacataag acaggfcgana cttcgagagc cctgtcatct tctaaacaga 
gcagcatcng cagagatnat nacatgtntc atattgggaa aatgaggoac gttantgttc 
gcnattttaa aggcaaagtg ctaattgata ttanagaata ttgnatggat cctgaaggtg 
aaatgaaacc aggaagaaaa ggtatttctt taaatccana acantggagc cagctgaang 
aacagattct gacattgatg atgcagtaag aaactgtgaa attcgagcca tataaataaa 
acctgtactg tctagttgnt ntaatctgtc tttttacatt ggcttttgtt ™^aatgt 
tctccangct attgtatgtt tggattgcag angaatttgn angatgaata cttnntttta 
atgngcatta ttaaaaatat tgagtgaagc tnatngtcaa ctttattaag gattactttg 
ctgccaccao ctagtgtcaa ataaaatcaa gtaatacaat cttaataaac ntttaaacta 
taaaaactcg acccttagac ctatantnag tcggttn 



<210> 529 
<211> 821 
<212> DNA 

<213> Homo sapiens 
<400> 529 



tnanimannc annnnnnnnn nnnnntttga agccattgct acttgttctt tttgcaggat 
cccatcgatt cgaattcggc acgagagcaa ttccactcct agctccaccc acaggaaatt 
gaaagcaaag acgcaaacag atgcctgtgc accaaagttc acgggcaagc atccttcggc 
cttaatgggc agcattccgt cgtcacaagc gggcattcat cctttcatca atagcgggca 
gcattccgtc gtcacaagcg ggcagcattc ctttcgccac aagcgggcag catcttgtcc 
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!^^ 9Cg ^cagcatcc ttcgccaaag cgggcaagca tccttcgtca tagcggcagc 
atcctttgcc atagcgggca aggtggaaac cctgtccatc cactgaggcg tgcaSaact 
aaacatggcc agtccaggca ctggaatcca ggcccgtaga acggcgcccf cggtcataS 
gaatgagacc ctgatgcact gggcgacaca gacgggcgac acfjacttgg aglcatcaS 
aaataaf aa a ^ Cagg ° aC a <=^agcgga cggcgtgatc ctgctcacgt gltgtgtccc 
gaatgggcac gttcagaggg aagaagggag atggcgcttg ccggtgcccg gggLngggg 
ttgggagcga cggttgctgg tttggggttt ctttctgggg tgangaantg gStaatft 
ttggnccgtt ggtgatgttt gcatacctct gaatatgcl? afganc'acf gaatScca 
ctttaaatgg atgaattgna tggtattggg aattacccaa n 9aattgacca 



<210> 530 
<211> 765 
<212> DNA 
<213> Homo sapiens 



<400> 530 

gnntttnnnn nnnnnnnttt tatnnntaca gctacttgtt cttttta^ 
^ccgaaccc ggcacgagac taccccggct acggttoccc catgcctggc agcttoaccl 
tgggcccggt cacgaacaaa acgggcctgg acgcctcgcc cttgcccgca gataScct 
cccggt^ct lllaT^ " CCC *~ at 9-ctccttt aagaaagacg Icggcttcag 

IF = =s= s=s s~ 

S5 ~ ~= = = = 

gcctccggtt tccactactg tgtagactcc tgcttcttca agcacctgca gattctaatt 
aaattSta tT*?** ^tgcaggga agtcttactt iTsZltll 

Itlllt 9 9 39tgactc ^ tgtaaaacca tgtagtttaa cagaaccaga nggttgactl 
ttgttaaaaa caggaaaaaa ataatgtaag gtctgttgta aatgaccaan aSaaaaaaa 
aaactcngcc tntaaactnt tntgagtcgt nttcgtaaat ccaan aaaaaaa *** 



<210> 531 
<211> 768 
<212> DNA 
<213> Homo sapiens 

<400> 531 



gnntttnnnn nnnnnnnttt taagntactg ctacttgttc tttttgcagg atcccatcqa 

tcS» s sse; ™? 9 c ° s:™ tttSSS 
sees nbs 4 ssss ssss 

gctataSta taaa^^ agCagtagaa c ^aga gg ag ttggtgccat ggaaata|t? 
gctatggata tgaagcttag aggaatgtac attgctcgac aactcraactt Sr^L 
accttcaaan ttgaggaagt tcttctttct cagagctaca ^ Ctggagtg 
gtcaagctgt nggtcattgn cagagagccg gntcagc^ ctg^ct 
gancaacgaa tgaagaagtn catgtggggt cagttctggc tgLaccaga ggttcttSa 
atacttatgc atagcatcca aagttaaaag ggttgtgcac tagctcgaga SaaatcaJJ 
JaaS^n f gtngt " t ^"^agt ctcaggagaa gc^nnaacat Jag^ctttn 

?tgg::aaca SSSS^ f tganttg9 ttCaaCtgCC ~^tgtg cantcactca 
ccggaaaaca tttnctgctc cagcngggaa aacttatggt tacttggn 
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<211> 761 
<212> DNA 
<213> Homo sapiens 



<400> 
cgtnttttnn 
ttcgaattcg 
tgagccacct 
tgctgtacac 
ctggtattct 
agagtgttgc 
aaaataactc 
caaaaaaacc 
aacttattat 
attataccca 
ttgagggaag 
cacattttan 
gtgtaaaata 



532 
nnccnannga 
gcacgaggat 
tgcccaccca 
acacatatat 
gggttctgaa 
tctggtattc 
ctactacatt 
aagcaagcat 
aaatggaaag 
aagttctttt 
taccaaggca 
gggcattggg 
tctttagtng 



aagcccttgg 

cagcccacct 

catcatacag 

cactgaacct 

gtctggtatt 

tgagagttgc 

gaaatgcaga 

aaaaggtcaa 

aaagtttatt 

aaaaaatatt 

gcttttttcc 

taatcctgag 

gattgaaggc 



ctacttgntc 

cggcctcaca 

ttgaaatgaa 

ggttgaaata 

ctggtattct 

tctgtattct 

cttaaaaatt 

taagttgtaa 

tccttttttg 

tccatcacca 

tcaaaagtac 

attttactca 

cttgnggggg 



tttttgcagg 
aagtgntggg 
actttgccac 
aagntttttt 
gggttcaaaa 
gggttctgaa 
taaacattgg 
tcttgatagt 
gttgatgggc 
tttttattta 
ctggtcctct 
ntaaatcctg 



atcccatcga 
attacaggcg 
aaccagcctt 
tctttttcct 
gtatgacttg 
gattatttga 
attaggcagt 
aaaggtggaa 
agtatgccat 
aaataaacat 
ttgggaatag 
atggtactgg 



<210> 533 
<211> 735 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
761 



<400> 
taaacatcng 
cactgtccca 
ctccagttcc 
aggtgtgtgc 
gccggtctca 
ttacatatgt 
aaggtataat 
ttcatcaact 
catagttttg 
aggtgatctt 
atataaaagc 
ccctttnctt 
aaaaataact 



533 
gctacttgtt 
ctccatcacc 
tgggttcaag 
catcacacct 
aactcctgag 
cggctaccgt 
gaggtggtac 
aagatgcact 
ttagcaatac 
agatttgatg 
aagaagtatt 
taaacacgat 
tgagc 



ctttttgcag 
caggctggag 
ccatccctcc 
ggctttacat 
ctcaagtgat 
gtctggccgt 
ttatcatttt 
gtaacatctc 
actgtctttt 
aagcacagta 
ttttttttgt 
tntntcactt 



ggatcccatc 

tccagtggtg 

tgcctcagcc 

ttttctgtgg 

cctctgcctc 

tcacatcttt 

tactgngtct 

tgaaatctgg 

agtggtgcct 

tgcaggtagg 

aatgactgaa 

ncaactncaa 



gattcgaatt 
tgatcatagc 
tccccagtag 
ggtcttacta 
agcctccaga 
ggccactatt 
catgttttgt 
atatattatc 
aaaataatgg 
cctaatgggg 
agctgtctgt 
acttgctcaa 



cggcacgaga 
tcgctgcatc 
ctggaactac 
tgttgcccag 
gtatctggga 
tgcttgtgaa 
atatttttgt 
aatggtttat 
tatagttgtg 
gaagatggta 
ggatgaccta 
ctaatncttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
735 



<210> 534 
<211> 735 
<212> DNA 
<213> Homo sapiens 



<400> 
natngnttgc 
attcatacat 
agataatgtc 
aactagaccc 
agacctcaaa 
gtgggcaaag 



534 
tcctngttct 
ctatggtgac 
tttaataaat 
ccatctctta 
ctatgaaaca 
atttcttgtg 



ttttgcagga 
cacttttgac 
ggtgctggga 
gcatatacaa 
gctaaaagaa 
taatacctga 



tcccatcgat 
aaaggaatga 
aaactggntn 
aaatcaaaat 
aacatcgggg 
caaacaggca 



tcgagacaac 
agaacataca 
tccantntgc 
taattaaaaa 
aatctctcca 
accaaagcaa 



ccagaaacaa 
ctgggggaaa 
agaagaatga 
gttaaatcta 
ggacattgga 
aagtggacaa 
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cacccal:;: T^lTtl lTtT Ctg Catt9Caa ^ ^aataacaa agtgaagaga 
gtctacaaag aaat^ gactg^a" TtltlT^ tgtattaggc catttttgaa 
gttttgcagg ctatcaeraL ™ ? ! tttataaaga agaggtttaa ttggctcacg 
ggaagttt^c accSggtg Lt Ta^T^ 99WtJ 

gcaaaaanan gggggaagg* Seagate HT^" 

ggngggngct tgngg caaccagato ttgtgagtnc tcagatttgn 



<210> 535 
<211> 735 
<212> DNA 
<213> Homo sapiens 

<400> 535 



SSSSZ SSSS SST- ttC9Mtt09 3C ™= 

raw t S tc t3 cSt ssss ssss ais:?"" ■«*■'■—« 

attoraranrr ^ _^ 3 3 y uy tcuga attgctgagg aqatataaoa 

gcccgaccal ctac^tc c^cct^c 0^"° agCtgac ~t gaactcctga 
agatcactgc catcatctct cagc^g SS"^ 0 tgCaaggCgc 

ctgtcgctcc ctttgtttocr ttooahSS aCatattt ^ t ttttgacctg gagacctcag 
gtttcctcat gactlagaS acfcgafcS SSStS ^f^" ^gacaatg 
agaatgagtt ggagcaatct tttcJtgtga" cctccttaac ITttllT * f CCaCCagca 
taggttgtat tttcagtgqa caat^LL cccccctaac agatatttac tgaaggaatc 
aaggcaacag aaacaJgSg ag^cccga g^t^ ^T^ 9 act ^gg 
ancgattgct cttgtgaagl g?tS£2E gafcatS tgCgtgctgc cacgcatctg 
onatgtnctn aacan ^ccgtcact gaacattttc aggggaggct gtttacccag 



<210> 536 
<211> 785 
<212> DNA 

<213> Homo sapiens 
<400> 536 



S= ££££ tittz™ — — 

ssr Se— ss£ ?£i : ~ »~ 

atgntgccnc ctctntttat ^Stgatta III! t n ^o 99ag tttaggaaga 
gcnnttttnc tgctntgngg ncaS^ 2H atnnggggac tataagaaaa 

gtntgcctgc accaccS? gcc'cc?" atl^tcc ff^ 3 <*eet.tgct 
ttnncacatt gcctnattna tactnacnto ctn"*^ ntggatatgn atgccagggc 
aaacantggg tgcttgntto toat«S2l c tnatgacca anacatncac gtgataacac 
•atntteS gtgtn^ttn agttSataf a9gnganctn "ggnnngnt ggagnactna 

=£25 Sss S F : ™ = s= 
^-sn =~ ~ = ~ f 
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<210> 537 
<211> 967 
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<212> DNA 

<213> Homo sapiens 
<400> 537 



agtanggcgn ttcctaatnn annnggctaa gcgactttna aagangaggc tngcgtgntg 
aataccgnnc gaggggggat nacaatagta nacnnggtnc caatncatgc ttaacaccgc 
atntctttac cccccnannn ncacanatgc agacncacac atngcanncg nacacncaga 
cacacacang caagcactnn catgcatggc ccatgctcac acacntgnan nnaacatgcn 
dtagacatnt nagacacgtc atgtnacaca tgnnacacan gnnnaanaca ctgctttnca 
ngcanacnca gacggcacnn ngagacanac atgcnnaaac aacatgctcn ctcacntnna 
nncgntgggc cngtagtagt gtactgtggg tgnnactggg tgccatcnac nnngtatttt 
acgnnctttn aactaaaaan cttggagcct tnanttnntn tggtgantnc aatncctana 
antnncttga gngggatgaa ccctaananc ctggccctnn tnccnctttc aaggccnagn 
aattganatt attncntant ngnncacgaa gcttntggta ncangngncc cgagnnctnt 
tnaaanttnn ctnttttnan aatnaaacat tttancggtt ctnaggancc gngcctncng 
ggtanggann naattgtncc tgggnatagt tctcacaant natnttnaag gggnnaagng 
atnngngngg nccntntatg nggcnngcca annaangggg tcgnngttaa natattccaa 
gntaacanan gnacnatggn accnatccct ntnngaagna aggaactncc tgnncgacta 
nnnactatgn naaatattct cacatntaca naaaaagnag gnnccnnggt ncttnaagnt 
tntgcatagn nactatncnt gggacgngtt aacnnanatt ntatgcttta nnngatnggg 
gcttnnn 



<210> 538 
<211> 892 
<212> DNA 
<213> Homo sapiens 



<400> 538 

gctagttnga agaggtgttt ctaangnntn ggaatcgaca tctnnnnagg cngnccntgc 
gattcgcttt gctctctcca ttccaagttg ttctctgttc tagaaagcng atgnngggnt 
acatctactg tttttgccta aacagaatcc ctttntcctt tttttgttaa aaggctcatn 
cctaatatta cattgctctg gaacgantga caataccana actcagcacc ntgatcggac 
cgggacaatc agattatcta attcctcagc aaacggagat cgatccgaaa agtggaaata 
tganctcntn ctttgtgntg gcatatggac cctgagagaa agaaacttta atcttttact 
cttggactgc aatnaagtnt agctgcctaa aaatcnnttt cntgacactt ngnaggtttg 
tccacaatcg ggngaaatta nngggtnnga cntaancact ggatgaaaaa aaatnccgnt 
tanttntatt ncnnttccan ncttntnaaa tanananttt ntcanccttn nntaatacta 
ttanntatat ntnttnnncc cnnatnnncc ttcttnctcc tacnncnntn cnatntnnnn 
nnangntcnn cnarmnnttc tnttatttct annatatntc ntancnttna ctaaaacctc 
cnctcgtnna nattncnnta taatattntc tctaganntt ntnntntntt gnnncttaaa 
anctcntcta tccctantat nantnattct taccatnaaa tacactanaa gtnntntcac 
gagacncgnt atgttantnc anactataat cgcttncatn tanntatatn taaaantgct 
atncagnnag nngntnttat atntttanct ngnnaggnta tcctcnatan cc 
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<210> 539 
<211> 751 
<212> DMA 
<213> Homo sapiens 



<400> 539 

gnnnaggttn tagancagct cttgttcntt gngcaggatc cctcgattcg aattcggcac 
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gagagtgtca gttttcctaa tctcagtcca ggtaggaatt aagaaatatc tcaagtgttg 
atgctatcca agcatgttgg ggtggaaggg aattggtgcc cagaaaatgg gactLaeS 
aggaatatct tttcttttga gagtaccccc agtttatttc fctgtgeS ItttgcTcl 
aatZt * ^tgaatgttg taacatttta aaaatgtttt gccaLgctt tttaggactt 
fSatS 9 9agCCa ^ 99 tetctctggg tgggtactat aatgagttat tgtgfcccac 
agctgtgtgg gaccacatca cttgttaata acacaacctt taaagtaacc catcttccaa 
gggggttcct tcatgttgcc actccttttt aaggacaaac tcaggcaagg a^catgttt? 
aST^ aCaaaatCta 9e«g.ctgtg ggtatccata tttSattJ? cggg^acac 
atgttettgg taactaaact caaatatgtc ttttctcata tatgttgctg aKgttttaa 
taaatgtcaa agttctcctg ttaaaaaaaa aaaaaaaaaa actcgancc? ntfnactata 

IIIXIT att ! Cgta9a ^-^catg atnagatcat tgatgaattt gg^accc 
aactagaatg cagtgaaaaa aatgcttttn t 



<210> 540 
<211> 761 
<212> DNA 
<213> Homo sapiens 



<400> 540 

gntnggntcn agancagcta cttgttcttt tgcaggatcc ctcgattcga attcggcaco 
agcctgcagc cactaatgca ttgtgtatga taacaaaaac tctggtatga cacSttct 
gtgatcattg ttaattagtg acatagtaac atctgtagca gct^ttagt alacctcata 
tgggggtggg gtgggggtgt attccttggg ggatggtttg ggccgaatgg gglgtggaa? 
atttgacatt tttcctgttt taaattctag gatagattt? ScaLctS Icggtcccag 
Sttftatc i:t gg T " tagtCttCtC -tcctaatc catgaccact JtStSS 
imitl I acca ^ ta 9 c ccactgagtt aatatttaag ttgtcaatag ataagtgtcc 
llttll 9 9 gcataatata ^tgaatttc atgagaagat ttattccacc aggggtattt 
ItllttT^ otllT tgtatCtaat -taaccaat ctgttggatg tlggSttaa 
aaaatgtttg ctaactaccc aagtnagatt tactggatta aatggccctt cgggtctgaa 

gcctcclaaa ItT^" ccgtttaatt ttgataagat ncSaaa 

gcctccaaaa gtgttananc caatcatttn aaataaaccn ggntgtatat tgcatnatat 
gtacatgcnt atncccttct ggttaaaact naaaaaaaaa I ^gcatnatgt 



<210> 541 
<211> 748 
<212> DNA 
<213> Homo sapiens 



<400> 541 

ggtttanttt aaatccntnc ncagctactt gttctttttg caggatccca tcgattcgaa 
acc 9 S^ Cga gC9gaCCCat c ^gcgtaa cctggatctc cgcaggcctg gcggaggccg 
gocacctgga ggggcattgc ttggttcgcg tggtancaga ggagcLgag aatgtJcgca 
tcttaccaca tacagttctt tacatggctg attcagaaac Stcattagt ctggaagagt 
JaS 9 r Ca taagagagca ^aaagaa ctagtatgga aacagcac?t gc^ttgaga 
T CaaaCaat 3 c -agtccttg ggattgtgac cccaggaatt gtagtgSc 
taS 9 ?r a " gtag ° aat ^acctcagg aaatagaaat tggagaatct ggt?t?gctt 
tattattccc ttcaaattga aggaataaaa atncaaccct ttcattttat taaggatcca 
ttlTattat Cattagaaag -atnaactt actgaagtag gtcttttaga taccctgaac 
gcafgSS S 9 gttataa " g ct ^aaggtg ggagccagta attatc?gca 
gcaagtagtc acncttttca gtgatatgaa tatcatcttt ggcttggang ccantnoaca 
acctgncatt actgactttt tgaaaanaac cctctggata Kgatglctc gggtg^gtt 
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ggactgncat ttagtggacc ccgaatcc 

<210> 542 
<211> 784 
<212> DNA 
<213> Homo sapiens 

<400> 542 

gtnnnnntng tgtaatcgct tggctgcagg atccctcgat ggcgaattcg ^acgaggtg 
ttgctcaang agcagacccg actccntaag gtcatcattg aatgggcatn atangtttga 
anactgtcca ananantang ngtcaataca tcaacnnctt tanntgcttg atattgnnat 
tgaanaacac angnctcngn ctagttcgcc tganatgatg tttaagatac tccggaagga 
gacanantgt tntgantgcg gattaganac cacngaagnn acactnaagg ancancatct 
ccacctngna actgnattnn cngaccanaa aagngaactg gaccaaatgc tctcaaaggt 
gctggcagct taanagcgtg ttangactct gcacgaagan gacaggtnnt ntgagagcct 
ggnnannaca ctctcccaaa ctaaactgna nctttcaaca nangggancc ccannttggt 
ggagaaatca ggtganctgt tggcccttcc acaaagangc aaattctntg agggcnagac 
ttninccttt ttgcngaacc agtncttgac tgactaaatg aaagcttttt aagccaggtg 
gcccancctt aangaagcna ctttttaatc cancggaacc ngcttgagan aaaaccnttt 
ttgacccaaa accnggagaa ccagctggcc taccaaaggg aaatgggccc ccatttgaac 
ttggggttnc ccangaacaa nccttgnccg ggncaaagcc cnttgttgga aaggacctca 
acct 

<210> 543 
<211> 764 
<212> DNA 
<213> Homo sapiens 

<400> 543 

ntantaaatc ccttgctctt gttctttntg caggatccca tcgattcgaa tncggcacga 
ggacccggcg gcgcggacag gcttgctgct tcctcctcct nngactcacc attncaganc 
SaannSaa aaaatggnng anctcaccca ggtaanggat gatgaagtnt tnatggctnn 
tgcatactat gcannanttn tncttntgna aatgatgcnt atgagtactg taanngnntt 
ctatncattg ncaagaangg ntnttgncaa tncatangac tgtgtagcat tcggcanagg 
agaaaatgnc aagaactatc ttcgaacaga tgacanagtg taacgggtac ^agagncca 
cctgaatgac cttgaaaata tnattccatt ncttgnaatt ggcatnctgt attccttgag 
tggtcccgac ccctctacag cnntcctgta ctttagacta tntgtcggag ^ncggntcta 
ccacaccatg tgcatatttg acaccccttt cnnatccaaa tatagctatg ^tttttttn 
gtaggatatg gannactctt tccatggctt acacgntgcn gtaaagtaaa ttggccctgt 
gcagaaaaac attccactca gtnttccaan tggcttntta aggaattctn gaccttgcaa 
ttnatantgg agnnctttcc ttaagattta aaggtttgan ggngagccnn aggaattntn 
aaccnggggt aaaccctttt tggaattttn agcnttgnca anaa 

<210> 544 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 544 

gatgctggnt ncnnatgctt gnngatccct cgattcgaat tcggcacgag gaaatgtgta 
tttcagtgac aatttcgtgg tctttttaga ggtatattcc aaaatttcct tgtattttta 
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ggttatgcaa ctaataaaaa ctaccttaca ttaattaatt acagttttct acacatggta 180 

atacaggata tgctactgat ttaggaagtt tttaagttca tggtattctc ttgattccaa 240 

caaagtttga ttttctettg tattacattt tttatttttc aaattggatg ataatttctt 300 

ggaaacattt tttatgtttt agtaaacagt atttttttgn tgtttcLac tgaagtttac 360 

tgagagatcc atcaaattga acaatctgtt gtaatttaaa attttggcca c?tt?t:tc«g 420 

attttacatc attcttgctg aacttcaact tgaaattgtn ttttnttttc tttttggatj 480 

tgaaggtgaa cattcctgat ttttgctgat gtgaaaaagc cttggtattt tacattttg! 540 

aaattcaaag aagcttaata taaaaggttg cattctctca ggaaaaagcc atcttcttgn 600 

atatgtcnta aatgtatttt tgncctcata taccggaaag ttcttaattg gattttacS 660 

gctgnaatgc tttganggtt ttaaaaataa taacattttt aataattttt taaaaggacl 720 

aactttcata atnatcccgg ngntcctttn ccnnn ™ 

<210> 545 
<211> 767 
<212> DNA 
<213> Homo sapiens 

<4G0;> 545 



agnttttnaa tcctttggcc antcgcnctt tntgcangat cccatcgatt cgaattcggc 
acgagaaaaa gtnaagcttt tcatgagcac anntnccttg cattgttnga tgttactgat 
attcgtaaaa tgaatatttt ctgttttgtt ctgttnnatt tttttgagac aagtcttgct 
ttgttgccca ggctggagtg caatggcatg atcttggctc actgnaaccc ctgccttgcg 
agttcaagtg attcttctgc ctnagnctcc tgagtagctg ggattacagg cgctcaccac 
cacacccagc taatttctgt cttttnagtn gacacagggt tttaccatgn tggccaggct 
ggtctcaaac tnctgacctg aaactnctca caccngtnat ctcagcactt tgggaggctg 
angtggaaag gatcacttga agccatgagt ttgagaccag cctgngcnac acagcngag! 
ccccngtgnt gtacaaaagc ttncnacatt tanctggctg aggagtnnct cacccntaac 
ttccancnan tcnnttaagc nnanncatnt tgaacacntg agcccannta nggtcgatgc 
tnntagtnaa ccgtgactgg accacttaca gtccaagccc gggtngcctt ataaaagaan 
cggaaaacat ttcnttaatt cgggttnnag cnttanctat ttcggaatnc cttgngtttt 
naaaaacttg aatctccaan aaacagggtt ttttcttttg gnccann 



<210> 546 
<211> 989 
<212> DNA 
<213> Homo sapiens 

<400> 546 



tncccttgtt gaaanccctt tgctcctttn tnctnccgtt tgncatncna ttcgctcagc 
tgaggcaatt aaactggaaa agaaatagat tgaaaagata ctntngaaga agcagtacag 
aagttggggg actgaaggag agggagccac tgcaggtgct agctgcttaa ggggatacca 
lilt tT° a ^tataata gatacagctt ctgaggtgga gggtgatagg agtgtgtatg 
9C "r" 1130 " CtgCtCnt * c ^cctnggca anaggannan cntttcnccn 
nttncnnccc ttatngnaca cacattgncc tgattggncn tnccncngct agcttncagt 
cttnantnta ctcannagnn nntnggggaa cncnctntcn nantatgntc ccttttcctc 
tnncntnncc nnatancacc ccnctcnctt tcctttctaa acttncacan ntccctgana 
atgncttccg aatggantct tngaatttct ncgcccctnc ntcntcataa tcnttttgct 
nctccngctc nccctcattt tnctacgtnc cnccttctnn ttnactgnct ttaaatntta 
ttancnncnt ntncnttncn atctncaant tttcnnnccn acnnnntttt nctnntnnca 
aatcgcgnna aataagtntt gcncactcnn ntnctancnt attntccctc gcnnttntcn 
tcatctcccg cnncactcac ntnnncnimt caattnntnn nnacncncnc tgctctacnn 
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ncnatntctn tncctncaca ccctntancn tntcnctcan aatgcctttt ctnccttann 
nctntcnttc ncnnatctan ccaantttnc tttnacatcc cctncnnntc tnncccgacn 
atatntnacc tcttnnatcn cagngcntan natcnccccn ttntcnctnt cnctctcann 
cttntnttna tcttcatnna tcanncncc 

<210> 547 
<211> 781 
<212> DNA 

<213> Homo sapiens 
<400> 547 

tqtnnetttn cnncctcnnc cgaaatcnct ttgnttctaa ctttcctaat tacctgggct 
acttgcacta tcccntcgat ncgcatagat ggccnngtta ctaanggtga ntttccagcg 
cggggggcac gtggagtcac tggaacattt gngcaatgct ggtgggaatg tcaacccgng 
cnggcctctg gaatangcct ggcnnntcct gcnagagtta ccntgtgacc cagcaattcc 
actcctagct ccacccacag gantngaaag cnaagacgca nacagatgcc tgngcnccaa 
anttcacggc agcatcctnc gccatantgg cancatccgt cgtnacagcg gcatcatcct 
tcatcattac ggcancatcc gtcgtaacag cggctacatc acttcgccac agnggcagca 
tctgtngtca cagnggcngc anccttngcc aaagcggcag cntccttcgt catagcggna 
ncatnctttg ccatancngc naggtggaaa ccctgnccat ccactgaggc ntncatanac 
tanncatggn cagtccaggg cactggaanc cangccgtng aacggcgccn acggtnanna 
ggaatganac cntgatgcnc tggggccana catactggct anacanactt ggagacatca 
tgcttanttg nannnccant cacacttgcn nncggcgtna tcctgctcac gtgatncgac 
ccgaatgggc acttcaaatg ggaanaaggg ngatggcact nccggtnncc tnganagggg 



n 



<210> 548 
<211> 735 
<212> DNA 
<213> Homo sapiens 

<400> 548 



<4UU> 3*0 

tctaaacgct tggnncttgc tctttctnca ngnanccnnt gcgntncgaa ttcggcacga 
tctagatatt gcccaatcgc tgcccacagt gcacatacct ttccaccagt cacatgtgag 
agggcagatt ttccaaatgc tcatcaccac ttggcactgt gtggactata attttggcca 
gttaggaaat ggcatctcat tgttttcatc ttaatttgcg tcagcctgat tactcattga 
aacttgtgag gttgagaaac ttttcttaag cttattggcc attcaagttt cctcctttat 
gaaatggttg ttcatgtcat ttgctcattt ttatattaga ttgtttttct tttttccagc 
tgacttgtag gaactctaca tcttatcaat attaatcatt tatcgaaaac tatttgggtg 
ccattatctt ctcctagtca atgttttttg tttgtgatat cttttataat atataagttt 
ttaatgttgg cagaagtaaa gttaatcttt ttggctgtgt tgtgtgtctt gtttgatgta 
aagatagttt ctgtaatagt tttgcagttt gattgntcat ctttaggtct tcaattcaac 
ctgcacatcc atcccctcta tcctctttct tactctgttt ttctccatac cacttatcat 
ccaataatat ggtcatgccc tttattnacc ngntttgcat atataatttg gcttgtnccc 
ggttccttcc ctana 



<210> 549 
<211> 812 
<212> DNA 
<213> Homo sapiens 
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<400> 549 

ttctaatact tggctetngt tctttcngca ggatcccatc gattcgaatt cggcacgagg 
ggaaggagcg ggcgtgaggc cagctgaggc atggtgaccc ctgggaagga gcgggcgtS 
XaacSf ggCat9gC9a ggancgggcg tglggccgc ?tSSS 

gtgacccctg ggaaggancg gncgtgaggc cagctgaggc atggtgaccc ctgggLcgg 
gggacttggg ggccgacctt ggtttgccca gggcccctnc tgcaccacgg ccacatgcgg 
aggacggcgt tgggatangc tccctgggtc cacagcttct gcccgtgtat tggggafccc 
tncttggtca aggcttcang ctcttggcag atggggcaag gaaccctgag gcttccgcgc 
ccttccatgg nctctgatgt gggacacttg aacgangcac gattctgLg gactccftgg 
Stto^r 11 gattangCCC a «"cngtt ggtggncnaa agccgtcctt Lggggcccg 
HIT, !" 9gaCaaCt "tcnggtcg ggcttgttgg gccccaatcn ttiggttggg 

naanttcncc ttaaaccttg ggcccgncct tttaaccttt tttcccaatc ttKgaccTt 
tttccaaaaa ggggtncccc tgggcttttt ngggncaatt ggttccgggg gccaaaggtt 720 
gggaaaaaat gccttncatt gggnaaaacc ctggatccct tgttaancct ^ggg^tt 780 
aaaatggaat gaattttccc cccgggcttt tt aggagntc 78 o 

8X2 



<210> 550 

<212> DNA 

<213> Homo sapiens 

<400> 550 



<210> 551 
<211> 736 
<212> DNA 
<213> Homo sapiens 



<400> 551 

taaaccttS tlT^ ttttCtaatg <*nggcgatt cgtcctggtg tcaaacacta 
taaacctttg accagctgag ctgtgactgg ctgtcacntn tctgagtcct gtgtgcacag 
a^S C ^ ggtCaggtaa aatccaggtn ttcaagtttt aaggnttttt tgLLattc 
gggcttnttt aanacgatcc ntgcccaant ccacaagctt gttgacagtg gnttacaatt 
gltaaSJ T^T^ ***«*tgn gctttaaaaa LaLttatc Scatgtatt 
atto^^ 39 aga ^ atgag ^ctatttat caggaccagg aagatncaca cttcaggtcc 
attgcaactg acttttttct tgtttttctt aaaaccctgg tggagcctgg gaagggggcc 
ttacc" tgtggct "* atattagccc caattttaca agcacataca agc^cSaa 
ttgccgcagg aaaacacaag atggaaaatg caataaccca tgcactgaga c^tagaaaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



ggnnantcna tgctggtctt gtctctntct aaaagttggc nattcgaatt cggcacgagg 
t^TTJ 9g ! atg9tCt ^cctgctact ggagagatct cctgagantt cfgtttfggl 
ttggtgctgt catcttcctg ggaatgcttg anaaagctgt cttctntgcg gaatttcaga 
ntntccgntc caaaggagaa tntgtccagg gtgctttgat ccttgcaLg ctgctttca" 
cagtgaaacg ctnactggct cgaaccctgg catcatagtc agtctggga? atggcatcgt 
caagccacgc cttggagtca ctcttcataa ggttgtagta ncagg^ccc tcXtctttt 
gtnctctgca tggaaggggt cctcagagta ctgggtattt tncttatccc ttgactctga 
aS 3 ; Sgccctntca gcagtttgac gcctgggtat ttatggatat taattagcct 
? a !^ aaaCa at * aa 9 ct " taaaacttcg gaggaacatt gtaaaactct ctttgtatcg 
gcatttcacc aacacgctta tttggcagtg gcagcatcca ttgggttaat catctggacf 
cat^r 5 tCaanaatag t9*«tgtca ntcggactgg ccggnagctn ttgggtagac 660 

gc™t SSSE cT tcttcca t9ancccctt tggcaatcat gg ™ g ™ 
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catccttact aggcaaaatg tattatgatg caataagtgc cactgggnat tttnacgttg 600 
ggactggnca ggaactgctg caaagaaaaa taacagctcc ttctccatta tttacattta 
agatgttggt ggggggaagg ttgggagaaa ttagttctga gggtatcata tgcctttttt 



aaagaaaatg ggaata 



<210> 554 
<211> 766 
<212> DNA 
<213> Homo sapiens 
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<210> 552 
<211> 733 
<212> DNA 
<213> Homo sapiens 

<400> 552 

nagtttaann gtatgtcttg tcttttccaa gatcctatcc gattcgaatt cggcacgaga 
agtgtcagtt ttcctaatct cagtccaggt aggatttaaa aantntctca agtgttgatg 
ctntccaagc ntgttggggt ggaagggaat tggtgcccag aaaatgggac tggagtgagg 
aatatctttt cttttgagag tncccccagt taatttntnc tgtgcttnat tgctnctgtn 
ctttattgtg aatgttgtaa cattttaaaa atgttttgcc ntagcttttt aggacttggn 
gttaaaggag ccagtggtct ctctgggtgg gtnctataat gagttattgt gacccacagc 
ttgtgtggga ccacatcact tgttaataac acaaccttta aagtaaccca tcttccaggg 420 
gggttccttc atgttgccac tcctttttaa nggacaaact caggcaagga gcatgttttt AQn 
tngtnattta caaaatctan cagactgtgg gtatccatat ttnaattgtc gggtgacaca 
tgttcttggt aactaaactc aaatatgtct ttctcatata tgtgctgatg gttttaataa 
atgtcaaagt tctcctgtta aaaaaaaaaa aaaaaaaaac tcgagccttt anaactntnt 660 
gagtcgtnta cntagatccn gacatgataa gatcatgatg agtttggaca accncactng 720 
aagcagtgaa aaa 

<210> 553 
<211> 870 
<212> DNA 
<213> Homo sapiens 
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<400> 553 

nagttaanag taggtcttgt cttttgcaag atcntancga ttcgaattcg gcacgagtat 
ataacaactt ttgctttcaa agttgggtgg gactagancn cncantggaa ggntggagtc 
agganacctg gattnttgng cccgntntgg nttttacagt ntgcctaant ttntgcagtn 
acttcntgcc ancctgtttc nttacntnca anagggaaag acantccttg gccagcctag 
ttttnagggt gaacgaaagg tcnttntcac tgcntcctct agtcatttgc ttcttcgnta 300 
attaacacat cttgagcacc tgcnatgttc caggaacagg agatggcanc gtgcaagata 360 
aagtccctga cttctagaga ctgcatgtta gtggcaatcg gcgtntaccc ggccttnaat 420 
aaactactga atgaaggaaa attctaccta caccagacac aattactggg gtttctaaaa 480 
tggaattatt cccccggccc cntgcatcca gcagcctgnt gcagggaaac tcctccnaaa 540 
ggcttgtaag gcaaggaanc cgggacaatg gcntggctat ttaagcttnc aacaagatgg 600 
ttacccctaa gtncctaatt ccctaacacc aagggggccc tttaccagga aaccaaaacc 660 
aggttaaaaa accccaaagt tgggnaaaaa gccatttgcc anccggggcc nttttaaaaa 720 
aaacctttna aaaacctttc ccttttaaaa ctttaccttc aagntaaaan tttaagggga 780 
atgggnccaa nttttttaac canccccaaa aaaaanttng gnaatttttt ttcccnaaat 840 
tttttnaant tccccaaatt tnggaaaang 



870 
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<400> 554 

tatcaatgnt atgtntggtc tnttcgaaag anctagncgg ntcgaattcg gcacgagcca 60 

acacccagtt ctnactctgt catccaggct ggtgtgcagt ggtgcaatgt gggcttactg 120 

cagccttgac ctccaggaca agtgatctcc cacctnagcc tccggaatag ctgggactac 180 

agntcaacaa cgcccctctg aaagtaggac tcttggaaat gaaccttgtt gggagtaaag 240 

ctgaaccttc acctctcctt tccaggattc tactccattc atacggcctc acactgaatt 300 

aatgtttnta gcagccacat cacttngtta cccaattgat ctagtagtaa agtcttccca 360 

tctnttcatg taaaaaaaaa aannnaaaan gggnnaggaa ccntnangnt nnnaanaaaa 420 

aaaaaaanca ggnggngngc nttttttaac ctataacctg ntttnaggcc tttccccang 480 

tttntccnaa ncnnggttan taggggccna aagctaaccg natttttgnt cccntnaggt 540 

taggccngaa attaaccngg gtttaaagaa cncattgant aaagccttgc ctnggccaat 600 

tccgggaaaa gggaanagcc tccttgtttt acanattggg aaaaattggc cccaangggg 660 

gttaaccang tttgcccntt aataactnaa anggattttt gncaaaacct ggttccaagg 720 
ntttaanccc aanccttttn aaanntnggn cnctttggat gnaann 



<210> 555 
<211> 770 



<210> 556 
<211> 756 
<212> DNA 

<213> Homo sapiens 



766 



<213> Homo sapiens 
<400> 555 

gttatccnat gngcgtntgt ngnnnncnct aanananttg gctngncgct gggccttgct 
tctctgagaa aactttggtc acacntccaa agccagggtg ggtgcctccn tgnaggaggg 
ggctttcctg gttggtggcn cagnaggagt ccaggctttg taccgtggac accatgggct 
atggcaacac cttcctaacc atccttccat gaggacctcg gnaganagtg gacatgaaac 240 
cctttgtgct ctgaancatt caacagaagc tttctggttc tgtgcctatt tctttggcac 
ttgancgtgt ttgcaggttc attacncaca tgatgaaagc tctggcccat agcactagaa 
ttcatgtttt nagggtttgt gagtgtgaca ggtgctatgg tttggatgtg gtttgtttcc 
accaaaactc ttgcttgaag tttaactgcc agcatggcaa ttgttggnag gtggggccta 480 
ccgggaggtg attgggtcat gggggcttga accctccgga atagattacn gctgcctcct 54 0 

ganaaagttc tacctgtcat gggggctgga tcagtcaaca ttgannantg gggttgttat 600 
aaagcaagac tnactcctta tgcaccgttt ntttgcatat gcccctctgg gggnancttc 660 
tttggctgct aacatttttg gacccaaccc aatgggcctt nacccagaaa nccggaacaa 720 
aatgccnnnn gccattcctt tnngganctt tccaacttnc canaaataat 



60 
12 0 
180 



300 
360 
420 



770 



<400> 556 

gtngtcnatg anatgtcttg cctnncgaag aacnaggcgn ntcggtagaa cagaaaatga 60 
gcatccgatt tcttcactaa aggagaccaa actggttcct tgcggcctag tnttnaagan 120 
ctggancttg aaagtcctcc ttntaccaac tccacntcca ccccntnatt cccnttntcc 
caaagtncta ctgntgttgc ntgacanccc caaatntgtn ctgtcaacac aaacctgcct 
ttggngtata aacagggcnt tacagaatgg tncaccctat atatttctgt tcagtatcca JWU 
ttcactagtt cttcattaat aaatatcatc ttccccattc tgctgctgaa tgccacacat 360 
ccatccagtc tgagaaagtg agagaggcaa tcatgccaag aacaagccag caaagctctt 420 
tcaccagatg tagactgtag ccctgctgcc ttccctccag cgagtctgcc agcatgcttc 
ttcatccttt taatatgtcc tttgcttcct acttccctgn cttccaacat actgtcactt 
actctggcag tcttctgctt ttcattaagc ctcaaaatct cctctgtcta cttggcacca 



180 
240 
300 



480 
540 
600 
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caagctatgt cctatatatg natttctgga cttggcangg atagttcaag gggtcttggc 660 

aagtttttat ttaccttcat tatttaaaan gggccttttg gggatgttgg cctntttaag 720 

gagccttttt ggggaaatca atacttctct taanaa 756 

<210> 557 
<211> 742 
<212> DNA 
<213> Homo sapiens 

<400> 557 

tcgtcnaaan nnatgtcctg gctatccgca ggatccaggc ggntcgaatt cggcacgagt 60 

gatttttttg gttttttttt ttgntnttgn caaaagctta ntcntttcan ttaaaantgc 120 

cactantttg actttttaag taaaaantgt agggggtttt aaanctactt tcctnctncc 180 
aaaaantcag aaagtttcta nctttntaaa ttgggaaagc aagcantgtt ttaaaancac - 24 0 

tgaaggaatc tctttnttcg ngnccttttg ttaaactcgg tttaagctgt agaccttntt 300 

taaantaaaa tttaccacag aacaggaaat agaanctgtg gaagactcga aatacacctt 360 

tgtncttctc tgttcttcac ctgctctctc gctgtctcta cacacacaca cacaaacaca 420 

cacacaccta tatttgcatt aaaaatgggt agtaaaagca gtgaagggca aacagaaggt 480 

ccattncatc aagtaagagg ttgaatataa actggaccaa gtcttaattt tttatttcct 540 

tcattcggat ncgtttacta atttctttgc tagctttaag acttttaaaa cattctttgg 600 

ccctgggagg gagttgttta cccctaaact tggagaatcc tggccctaga ataaatgttc 660 

cttttaaacc cccanggccg gaaaattgaa tncngctgtg ccaaaaagga aaaaannnaa 720 

aaaaaaactc gnggcctnta na 742 

<210> 558 
<211> 730 
<212> DNA 
<213> Homo sapiens 

<400> 558 

gggtcnntaa tntnnagcnt gtnaaacccc tgagncttnc gggncgttca caaagaaaca 60 

tttaataggg acttncaanc aaataattnt cggtttntca ggtggcagca agacaagatg 120 

gtggatcccc atgccattac ctgctagact cagggttnat atactgtagt ggaaaggtga 180 

ttccgaagga atgttgtaag acaattgaag tgcagtanca tcaaagttat ttgacctaag 240 

ggcaggagtt ncagtaagta tccactttta tncaagaaac antagataaa ctggaaatct 300 

tggagccctt cctggaactg gggttaatga gaagtcaaca tggtggatta ncatggaaga 360 

tggagttgct tagtctccca ttcaagatgg agtttcttta gcctccattg atagggagtn 420 

tttaacaaaa ncangaaata agtctttgat ccattgaatc tctaagagtg agcccttgat 480 

gactcaggtt taaacagtnc tgagacaatt taggagatag ttttgaagnt caatttgaat 54 0 

tgtaaaaggt caggattttt taactttttc acatctttga anaaaagccc atagagcgca 600 

agttttcagc aaganctgga aancnatatt nctatggaat taaatagctc ctcagggcaa 660 

tcaattnggc ctggganaac ataatgcttc aanggctgan gnaatctgga atttctatgg 720 

gatttcttca 730 

<210> 559 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 559 

gttagtctat aangtnngnt atgtactngc cctttccgnn ggatcccntc gnttcgaatt 60 
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cggcacgaga ggaaacaccc ccttataaaa ccatcatntc aggctgggtg atctgacaga 120 

gctagacact gtcaaacaaa caaacaaaca aacaaaaaaa ccccatcaca tctcatgaga 180 

cttatttact atcatgagag cagctcagga aacacccact cccgtgattc agttacatcc 240 

cactgggtct gtcccacaaa ttgtgggagc tacaattcaa gatgaggttt gggtggggac 300 

acagccaaac cctatcacca tgtaaaataa tatctaattt gtagagatta aagaacaaga 360 

taacttaaat cttggatgta agttaagaga gtggtggtca gagttaaatc attttaaggt 420 

tcatttattg tctggacaag aataaaattt tgattatcag gaaatacaag taaaaccaca 480 

gggagacatt gnttatatcc aaattgtcaa aaattacaaa gtcttataat accaagtttt 540 

gctganggtg tggagcaaca gaaacttttg ttcactggtg ggtatataaa ttgaataatt 600 

tcagcttgga cattacctag caaaattgaa ggctgtatac gtacatacct accaatctag 660 

caattcactt ctagatatta agtcttgaaa aactcacatg tttccagaga cgtgttaaaa 720 
ggtggttaaa tcattntgng aat 



743 



<210> 560 
<211> 833 
<212> DNA 
<213> Homo sapiens 

<400> 560 

atccngttct ntannnngtc tngttctttc tncacgatcn nntgcgattc gaattcggca 60 

cgaggggtcc tggtgggagt tccatccagc agtgagtgca ttttttcccc agagcagtta 120 

agggtcttat taaaagccac cactttgctg aggcctgtac aggccttggg ggtttgggga 18O 

agagaantaa ggcaggcact tgtcccttca gggagggact tgtccntact gggaggtttg 240 

gggttgacct tggctccagc agagataccc agcctggcnt ggaagggcag gtcttgagct 300 

tacgcttgac tgcaagggca agctgcaggc ctcttctgcc ttcccctgca ttcaccaagg 360 

acaagtagga ccaagaagtc aagggaaaag tgccaagata gatctattcc catttctttc 420 

ttccacctgg agaattcctg agctatgctt caaacctctt ttgggccagg gaaagactgg 480 

gggacatttt ttagtcaagg atgctttaag aaagtaaatt cctgcttggg ggcccaggcc 54 0 

ttctttttca agggcttgct tgtgaatgcc caaccaaaaa aaaggggccc ccaaggccca 600 

atcccttact tcctnggtcc ccccaaaaag ggatnccaan ttggggaatt gggaaaactt 660 

gggcanncac ccnaanccca ctttggtagg anttnaccaa cccaaccaac ccaaaaccan 72 0 

cccacccaaa ttnaaaaaaa ggccaaaacc accaaccaac cnaaacccnn annnnnnnnn 780 

nannnnnnnn nnnaaaaaaa ctttgangcc ttttaaaaac tntttngngn ggn 833 

<210> 561 
<211> 773 
<212> DNA 
<213> Homo sapiens 



60 
120 



<400> 561 

tagtctaatg tnnnaaantn ngcnctngtt ctttctgcag gatcccatcg attcgaattc 
ggcacgagga agaggaggct gtgtatgagg aacctccaga gcaggagacc ttctacgagc 
agcccccact ggtgcagcag caaggtgctg gctctgagca cattgaccac cacattcagg 180 
gccaggggct cagtgggcaa gggctctgtg cccgtgccct gtacgactac caggcagccg 240 
acgacacaga gatctccttt gaccccgaga acctcatcac gggcatcgag gtgatcgacg 300 
aagctggtgg cgtggctatg gccggatggc cattttggca tgttccctgc caactacgtt 360 
ggagctcatt gagtgangct ganggcacat cttgcccttc cctctnaaca tggcttcctt 420 
attgctggaa gaagaagcct gggaattgac attcagcact cttncaggaa taggaccccc 
agtgangatg aagcctcagg gcttccttcc ggcttggcag actaacctgt caccccaaat 
gcagcaatgg cctggtgatt nccacacatn ctttcttgca ttcccccgac cttccagaca 
gctttggctc ttgcccctga caggatactt gagccnagcc cttgcctgtn ggccaaaccc 660 



480 
540 
600 
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tgaattgggc cacttgccaa acttgcnggg gaaagggttc cttgaaacaa gggggccatt 720 

tttggggaag gcttcttggc ttggcctttt ggcatttnaa tttggccttt ttt 773 

<210> 562 
<211> 655 
<212> DNA 
<213> Homo sapiens 

<400> 562 

nnatanacat taangnnaga ngntgagnan ttnccntcgc tctntganna naaggcgncg 60 

cgaattcggc acgaggccac cggtctcttc ctaatctgca cattntattt tgggtatttc 120 

tgggcgggca gttcctttgc atgtttcggg agaggtttgt tgatttgggg cttatatgtc 180 

aggcctttgg tttgcgtctt attttagggg ttgtttgggg gcctgggtgg tcggcctcac 24 0 

atgggaaggg gatgggtagt ggatggggtt tctgtcgnat cttgnggccg gtgattttgc 300 

tnncgnnctg tttcacattc ttccccctcc acaagccaaa tcgttcattt ggntncactg 360 

tgtggactgt ctgagcttgc cctgccagaa aaatttgggg ctaggcaccc aggtgcanac 420 

tttggaagaa gcantccacc tgtgggtacc gcatctcgtg ngtcccactg gcaggctgaa 480 

cctacttgaa catggaaaca gcatgcccat atggcaaagg ggccnnnacn nnngnnnaaa 540 

tnnannannn ncngacannc nncnnaatca ngannntcna cannnatcnn annnnancnn 600 

nncaantacn ncnaaaacac accnnccana annnnnaann nnnnnncann nnnac 655 

<210> 563 
<211> 738 
<212> DNA 

<213> Homo sapiens 
<400> 563 

tnntaatgct ggaattcctn atncttgggc tactcgttct ttctncagga tcccntgcga 60 

ttcgcagaaa agagtatagt aggggatgac caaggtcaaa gtgggtaaag aagactcatc 120 

atccactgag tttgtagaaa aacggagagc agctcttgaa aggtatcttc aaagaacagt 180 

aaaacatcca actttactac aggatcctga tttaaggcag ttcttggaaa gttcagagct 240 

gcctagagca gttaatacac aggctctgag tggagcagga atattgagga tggtgaacaa 300 

ggctgccgac gctgtcaaca aaatgacaat caagatgaat gaatcggatg catggtttga 360 

agaaaagcag cagcaatttg agaatctgga tcagcaactt aggaaacttc atgtcagtgt 420 

tgaagccttg gtctgtcata gaaaagaact ttcagccaac acagctgcct ttgctaaaag 480 

tgctgccatg ttaggtaatt ctgaggatca tactgcttta tctagagctt tgtctcaact 540 

tgcagaggtt gaggagaaga tagacccagt tccatcaaga acaagctttt gctgactttt 600 

atatgttttc agaactactt aatgactaca ttcgcttatt gctgcagtga aaagngtgtt 660 

tgccatcgat gaatgctgca gaaatgggaa gatctcaaat tctttgctca aaaacgtgaa 720 

cttaaccaaa atgatggt 738 

<210> 564 
<211> 798 
<212> DNA 

<213> Homo sapiens 
<400> 564 

nggggngtct aatgctgcnc nnatcnannc anggnnctcg ctctngctcn acnnanaagg 60 

cgntgngtgt gccaccacac ccagctcatt attattatta ttattattat tattttgaga 120 

cgaagtttca ctcttatccc ccaggctgga gtgcaatggt gcgatactgg ctcactgcaa 180 

cctctgcctc ctgggttcaa gcggttctcc tgccttggca ggcacctgta gtgtcagcta 240 
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ctcgaagctg aggtgggaga atcgcttgaa cctggggggc ggagattgca atggtgtggt 300 

ctcggctcac tgcactcgag cctggcgaca gagcaagact ctgtctcaaa aaaaaaaaaa 360 

aaaaaaactc gagccntnna actattngng aggtcgtatt acgtagatcc agacattgat 420 

aagatccatt gatgaagttt gggccaaacc ncaacttgaa tgcrmngaaa aaaagcttaa 480 

ttgggaaaat ttgggaatgc ctatngcttt atttggaacc ctttntaagc tgcaantaaa 540 

acaagttaan caccncccaa ttggcntcca ttttaatgtt tncagggttn aggggggaag 600 

gttttgggaa ggttttttna aattcncggg ccnnggggnc ccaatgcttt ggggccccgg 660 

gtncccaann ttttgggncc cttttaangg gngggnttan attggccccc cttgggggna 72 0 

aaancgnggn anatacctng gtcccctgtg nanaaatngg nttcccntta caaaatttcc 780 

cacnnanatt tnngnncc 7gQ 

<210> 565 
<211> 744 
<212> DNA 
<213> Homo sapiens 

<400> 565 

ttntiiiigtct naatnntcnn ggnntcgntc tnnctcnaan nanaataggt ttggcgaatt 60 

cggcacgagc atgctggcca gcatccctgc ctgtgcaagc tctggatgag ctgtgtgccc 120 

ctgccacnca cacccngcac tccctgccag cctggcctca gggcctctga tccatgtgca 180 

ctggagtggt gatgactgac agggccactg gggcatttnc acgttaacag cagctgccac 240 

tggcaaaaga agtgactcgc caatggtggc atctcagatg tgggcccagg agtctgggga 3 00 

gctactttga acagggctat ccattcattg tcccaccaaa ggctatggag cccacccacc 360 

atgtgctgga gtagtcaagg gaaataagac actctccttg tccttgttaa ctcaatcaac 420 

aagcatttgc agagcaccgc ctatatgccg gcgctgtccg aagtgctgaa gatacagcaa 480 

tgagctaagt aagcactgac ttcgtagaaa accataacat cggccatctt tggaaaagag 54 0 

aaaaacaatg gagttactta tttaaaaaaa aaagaaagaa agttatctct tccanganag 600 

gctagaagta cttttctgct ttttggccag tgcccantgg aatgcctggt ttgggggaag 660 

aagaagggac tgggttaact gtggtgcttt tgttgtaaaa aggcanctgg cctttgtact 720 

tgaggagaaa natggagcct tggg 744 

<210> 566 
<211> 756 
<212> DNA 
<213> Homo sapiens 



60 



<400> 566 

gnagtnntat tgatttntct ccgtgaatcg ttctnnctnn annanaagtg ngttnngccg 

ctggctatgt ggacgctggg gcagagccag gccggagtcg aatgatcagc caggaagagt 120 

ttgccaggca gctacagctc tctgatcctc agacggtggc tggtgccttt ggctacttcc 180 

agcaggatac caagggtttg gtggacttcc gagatgtggc ccttgcacta gcagctctgg 240 

atgggggcag gagcctggaa gagctaactc gtctggcctt tgaggtaatg gggggtggcg 3 00 

gtggtggggg gtgcttantg gctatgctca ccccgctnca ttangcctat tttggtctgc 360 

tgtttccaaa tgcttctana tctaggcatt tggtatccaa cctattgcca cantgcctan 420 

aactncanac ccccngccnc tatgntnana cctacttggc acaagaacaa nngnanacnt 480 

tgnnnatatn ccanaangnn naanattaca nantnttata ataccaattn ntnttgangg 540 

tgttnnnnnc anaaacnttt gntnacngnn nnnnntatna atnnataatt nnnnntttgn 600 

nancannanc tatgnnnaat taaangnntn tntncnnnnc nnnacnnnna nnnnnnttan 660 

nnanttncnn ttnnnntnnn nnnnnnnnnt tnaanaannt nnnnnttnat nnnannnncn 720 

nctnnaangt ntntttnnnn nnatnnnnnn nnnncg 75g 
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<210> 567 
<211> 746 
<212> DNA 
<213> Homo sapiens 

<400> 567 



<4UU> so / ■ 

gnntgtnntt nncnnannnn anganagagn tactcgctct ntctctacga tanantgngt 
tnSattcg gcacgagatt tcctccagtc ctgggcccca tccttnaggg ccttcccagc 
cagccagcag gagaggcaag aactggggga acacaggaac ctaggggagg aggggagcgc 
tgggcatcc? caggctggcg gccaagcctg cccctggagg cactagagga ggg=atctgt 
ctgtgggagc ccagagctgc agggaggagg aggagggagg tatctggtgt gagcgttgcc 
cctgcgacat ttgggaccac acaggtgggc ttccttattc cctgacaaag cctctgtttc 
cagctcttcc gccctctctg gatgagggaa cagaagtgga ggaaacaaaa gaagcagcag 
cacgcacagt cctgtcgctg ggtgcggaga cagcctggca aagtcccact cagccatggc 
ctgatgcang ccccagccct nctttcttgg gtgtcaaatg actgtgtcct ggacatctga 
tgcaccacct gccctgcctg ttgcaaacgt gatgctcccg gatggaatgg agaaactagg 
aqactgggac aagcaaaang ctgcaaacaa cccagaaccc attcttagaa nactggagaa 
a?gattgagg aatcattggc accgtggncc tgtgcttcat nacaaacacc tttnagaaca 
acttgggatt gaaaaaccaa gacant 



<210> 568 
<211> 738 
<212> DNA 
<213> Homo sapiens 

<400> 568 



gnnnntngtn gttcttanng ttnggatctc gttctttctn cacgatcncn tcgattcggt 
ctgggcagcc tacgctttcc ggataaaaat ggcagaatga aagaaattat gagtggaact 
agalaatagg aaagacatga accaacgccc aaaatgagaa agaaggacat ataaagaaaa 
aLcaaatac aagtgaaaaa aatagactaa tggattaacg tccctgtcgt gtgacatttt 
ctgctatgga aa^gatatta gacaaaaagc acttcaagtg gttttctf t ttgagttcaa 
Sgggtcat aacgcagcag agataacttg aaacatgaac agcgcatttg g-caggaac 
tactaacgaa catacagggc agctgtgatt caagaagttt tgcaaagcag *^agagcct 
tgaatatgag gaacacagtg gcoagccatt ggatgcttca cttcttgaag ^atcttgaca 
actttttgca ggtgaaatgc ttncacacca gcaggatgca gaaaaatgct ttccaagagt 
SgttgaLn Sgaacatgg atgtttatgc tgcaggaatt aacaaattta "tctcgttg 
gcaaaaaagt gttgattgna atgggtccta tttgattaat aaagatgtgt ttgagcctaa 
aaaaaaaaan nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
imnnnnnnnn nnnnnnat 



<210> 569 
<211> 753 
<212> DNA 
<213> Homo sapiens 



<400> 569 ^. fp 
gtttntgant ntgattctta tgcttngnct aatgctttnt ctnnangatc ccnncgattc 
gctggaggag aggagctcag agttctacag agtctttnct gaacaatatc agaaagctgc 
tgaSaStl gaagcaaagt tcaagcgata tgagtctcat ccagtctgtg ctgatctgca 
ggccaaaatt cttcagtgtt accgtgagaa cacccaccag accctcaaat gctccgctct 
ggccacccag tatatgcact gtgtcaatca tgccaaacag agcatgcttg agaagggagg 
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ataaaaactt tcagaatgag caaaacacca tcaacgttaa ttccagagat ggaacatttt 
gag^aSa CCCattt * aa gagaagaccc taatglgaag accSaala 

gagacatcaa gaatggattc agcagaatca tttcacgttt tgaacagcag caatttaaan 

£S£St «2 atCa9g gatCCC ^ tCa "aaaggaca cLttgagtf tll^tlllc 
ctcttatgat gattaaaaga gaagggcagc cctnttcacc tttttggtct ttctatt™, 
cctt^? 30 C f aaaat ^ ttctcttctg nacaaagccc catca^tgg tgScctcac 
ccttaacaaa gtaggattgg ggttgggggg cttaattaat tggaatgggg Saoaaaaa 
gagcccgaaa ccttagatnc cangggnana agt ccaaggagaa 



<210> 570 
<211> 832 
<212> DNA 
<213> Homo sapiens 

<400> 570 



t~" 3 a ssss a c :: t ~ Macc "- t — -» 

M y ggccccaata atctgggntt naaactttqa aaa^t-rr^ a 
a ^-www«u*. c-cagagcgcc tcagttaggg gaacttctct gtaaagaacc ctaaatat^ 
agcaaaaacc ttattatcgt taatgaccta taattggaag cttcctgcct tlllnltl * 
ttgctcctgt ggaaaatact gaaaagatta ctttgStt. ttttgt^gtc tttttataS 
aggggaggtg gagagacccc ttcagagcag ggattgtgcc gggagagtgc ctctaactt" 
llTt^ll atccaca 9 aa atttncaagc caatggtttc tttt^t tgggttttta 
tgtttgnttt ttgggggttt ggaaaaacat gcatttttac cgtgcacgta aStaatca 
nagaaaaagg gagcccagaa aangcagcan atgggccatg cccctttgct gJgttEct 
9g gaCt3tnaa ^ 99S— tggg tttttanaag gtgaagg^tt Stcctgtt* 
SI! 33339 a T 9tCtCt ^'"WSSgg acaanaaggn acccttgggg ^ggtccatt 
cgcaatggtn cctaccaaaa cnnggntctt taanaacacc ngggccttto- Lcla^n- 
aaaaocctgq ercccetfctaa Tia==^t-<-.- "ssstccccg ncccaggnaa 

ttncoaaato ?tthS~i naaaCttt ^ nangggaacc ocggaaaacc cccttgggcc 
ttncoaaatc ttttttccca aagncncccc cggggggccc aaaaaaaaac ct 



<210> 571 
<211> 748 
<212> DNA 
<213> Homo sapiens 



<400> 571 

nn^" 33 ^ ntggacttct aanganttng gctnntcgnt tggaannnnn cagtnctcta 
nnagcccatc gatgcgaatt cggcacgagg ctaggattac aggtgtgagc cac^taccc 
agccacttat ctttaaagga ttaagtttat gtttcctact atgggafacc atcccafc^ 

a r 9cattat ac ?sss 

t"tt?gcct ^gtaattac cctacacaca gcatacacat Ltcftcata 

sS r ? = 3s= sss 2=2= 

S SS= ~: — a sss- ESS 

« SSSS S55 =2£= aaagaccccc ~£ 

gatgtctaaa aaaattaaag ggccttcaac tcataataat acttaatagt tcttaaaata 
ttacaaactg attggaacat tgcctaac tcttaaaata 



<210> 572 
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<211> 755 
<212> DNA 
<213> Homo sapiens 



<400> 572 

agtcttatta nnnngttcta atcctttctt aangagnnta ggctactcgt nctttctgca 
ggtatcccnt gcgatncgaa ttcggcacga ggctgagcac ctttggaaac aacatttaag 
ggaatgtgag cacaatgcat aatgtcttta aaaagcatgt tgtgatgtac acattttgta 
attacctttt ttgttgtttt gtagcaacca tttgtaaaac attccaaata attccacagt 
cctgaagcag caatcgaatc cctttctcac ttttggaagg tgacttttca ccttaatgca 
tattcccctc tccatagagg agaggaaaag gtgtaggcct gccttaccga gagccaaaca 
gagcccaggg agactccgct gtgggaaacc tcattgttct gtacaaagta ctagctaaac 
cagaaaggtg attccaggag gagttagcca aacaacanca aaaacaaaaa Jtgtgctgtt 
caagttttca gctttaagat atctttggat aatgttattt ctatctttat ttttttcatt 
anaagttacc anattaagat ggtaagacct ctgagaccaa aattttgtcc catctctacc 
ccctnacaac tgcttacaga atggatcatg tcccccttat gttgaggtga ccacttaatt 
gctttnctgc ctccttgaaa gaaagaaaag aaagaagact gtgtttttgc cactgattta 
accatgtgaa actcatctna ttaccctttt ctngg 



<210> 573 
<211> 743 
<212> DNA 
<213> Homo sapiens 



<400> 573 

cangtctaat gctggctctn atcggttctt nnnantnaag ntactcgttc tttctncang 
natcnnntgc gntncgctca cacagcatgt gtcagatcca tggggtagga gtcggccaga 
gacttggtaa cagacagatt gctggatccc acccctagac tctctgattc agttagtttg 
gggtaaggcg caagactgaa tttttcacaa gtttcccagt ggtgctgata cttctggtcc 
aSaactLI tggggagaga acgactaatc tagaccattt cacttcacat tctgagcttc 
ttgtcactgt cacactgcat ccttttaaca atgcattccc tatcctattg caatactgac 
atctcatcaa tattttaaaa catgcgtttt cagaaacaat attttatatc aaatactcac 
ttttagtaat atttctgcaa ttttgcccta tggatctgag atctaacaaa tactattctg 
gacatgggct acaacagttg aggctggaag taaaaatgtt aaaccctgct gaccacgtta 
ttttaaagtg tattttagtt aagaataata tggcttagga gcagggctaa acagtagcag 
tcacatgggg aatgatactt tgcttttgca cataaaatgt cctgaaggga aaaaataaag 
cagaaaattn ncagatgaac tgaaaatctg tacaaatgtt gggctgaata ctgccagcgt 
tgangtgtag gaaaatgaac cnt 



<210> 574 
<211> 737 
<2X2> DNA 
<213> Homo sapiens 



<400> 574 

ccgtctaatg ctggnttcta atcgctttct taangctcnn gggctcgntc tcnctncacg 
cagcccggcg gtgcgaattc ggcacgaggg gattacaggc atgacccacc gcgcccagcc 
tgtnatttct tatactntgt attttggnct tgtattatgc ttctgatacg ctataattat 
ttatgtccat gtncntttct tcaatagact gtgaactctt cgaatgtngg actcctagag 
ctagatnctc nattattnnn tattaaattg aatgacttgn aactacagat cctttattta 
aacttcccaa atttctgctt tatctaggcn actctttaaa ttcttttatc tcatgtagat 
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gcnttattat T tttgaa S aaa agagaactgn ggatttaatg 

S""" a ^ ta ^"ttta atggctgttt gggagtnagg ttgcagacat tggtcacttt 
cctcctaaat ncttaaatat ttcctaaaaa caggncattc tttnttttnt tatggagtct 
ggctctggcn tccaggctgg antgccgngg cccatcttgg cttactgcag ctoS 
cgattcncgc tggtctcctg nctngctgct cgggaggc?* aggccnggga atcgttgacc 

SSS ag ™ 9 9 ~ ct ™* ^™ ™ 

737 



<210> 575 
<211> 766 
<212> DNA 

<213> Homo sapiens 
<400> 575 



gnagttnaaa agcggntttt antcotctcn aatcngnttg ggctactnqc tetttcton, 
ggnatcccat cgattcgaat tcggcacgag ctttcLcc? SgtgcScc tgcttccttt 
ctctctcctg cctctcctct gctccccatc ccactttctc atctSeetec 
^cgccagcc cgtaagcttt gataacctgc ttaatactcc aaag^gtgag ttcctctaa't 
ctcttgattc cttagttcta atctcacgtt ttgtttttaa gagatggL? ctctcaclct 
ItllZll 9 ? Ct3ga9t9Ca ^atgat catagctcat tgcatcct? g aaatcctggg 
ctcaggtgat cctnccgcct gagcctcotg agtatctggg actacagatg cgtgccacca 
agcctggcta attttgtctc atgtcttcta aaaattattt tgtgaagccc cttcacaaaa" 
aaccttaang gaaatctgat ggtgctcagg aatctaactc tccctaaacc atcctctttt 
aactgcttct aaaatatctc tggtggcctt tcttagcctt tttctggttc Jttcaatgct 
tcaaagcgct ttttgnttct aagttgagtn ctttgggggt ttgacfggta gtgacgtS 
gttttgacac tgttaacttg ttnaatacag tgaaaangtt tg^gaaglga faaatgcttg 
anaaagaatg gnaatgcctt tntacaaata aaagtnttgt taaaat aaaat * ctt « 



<210> 576 
<211> 761 
<212> DNA 

<213> Homo sapiens 
<400> 576 



Scaca^a g ? gnnttgan cccctttatt attatcaagg ngctngcnct nnctnnannn 
ancacaggcg ntgngaattc ggcacgagaa gataacctct taatgcattc atgttgtata 
tgaaggaaat gagagcaaag gtcgtagctg agtgcacgtt gaaagaaagc gcggccatca 
accagatcct tgggcngagg tggcatgcac tgtccagtag tatttattgc tttfgajatt 
taaaltttt T 9t *T° 9tCCCttttt aaa tatgttt tcct.tttct tgaSgta 
tttSS^ ttccccctta gcataagcat cttatatata acaactcatt tgtacaaggt 
ttc^Sr 3tatataaaa tgtgf f f tatttttgnt tccccttttt gacttttSt 
ttctgtatga aacccagatg tcaccaaatg gacattaata gttgcattaa ggatcagtaq 
aatttt^co TTIT aaa9CCatta tgtaaaacaa Jacttgaaaa 
tlttllT 9 aCaCtgtCtg ^acagtgg gaaccatctt cgtttcccct ttgaactccf 
"cgScaa aa^S* 9 * a ~ccggac tggcccgngt acaaaacttt 

^2^^ aattcttaag tgaatttacc tttctncctc tttttgaagc tngaaatttt 
tggtcatcan gntttgcttg tgatngtaca tanggtngaa n 

<210> 577 
<211> 803 
<212> DNA 



420 
480 
540 
600 
660 



60 
120 

1 fln 

240 
300 
360 
420 
480 
540 
600 
660 
720 
766 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
761 



165 



WO 99/33982 



PCT/US98/27610 



<213> Homo sapiens 



<400> 577 

gggtngttnn nnngtggnnt tnttnnnngt ttctaatnnt cgnggngntc ganctnnctc 
nananagaat aggtttgnga attcggcacg aggtctcccg cccggcgccc ccagtgtttt 
ctgagggcgg aaatggccaa ttcgggcctg cagttgctgg gcttctccat ggccctgctg 
ggctgggtgg ggtctggtgg cctgcaccgn catcccgcag tggcagatga gctcctatgc 
gggtgacaac atcatcacgg tccagccatg tacaangggc tgtggatgga ctgcgtcacg 
cagcctctag aactatagtg agtcgtatta cgtagatcca gacatgataa gatcattgat 
gagtttggac aaaccacaac tagaatgcag tgaaaaaaat gctttatttg tgaaatttgt 
gatgctattg ctttatttgt aaccattata agctgcaata aacaaagtta acaacaacaa 
ttgcattcat tttatgttca agttcagggg gaggtgttgg aggtttttta aatnnncggc 
cncngcgcca atgcattggg ccccgtaccc acttttggtn cctttaantg aagggtttaa 
tttgocccnc tntgccgtaa ttcatgggnc atanncttgn tttcctggng ttgaaaattg 
gntaatcccc ttcnacaaat ttccnccaca atcatttacc aaaccccngg gaggcctttn 
aaagnngtna aaanccctgg gggtggccct taatttaagt ggnnccttaa ctcncnttta 
antgccnttg cccttcactg cct 



<210> 578 
<211> 738 
<212> DNA 
<213> Homo sapiens 



<400> 578 

tcgtcccntn gatcggggta acgtccttnc ctatnaaant tctttcggga aagcagaaac 
caagctggca gaagcacaga tagaagagct ntcgtcagaa aacacaggag gaaggggagg 
agcgggctga gtcggagcag gaggcctacc tgcgtgagga ttgagggcct gagcacactg 
ccctgtctcc ccactcagtg gggaaagcag gggcagatgc caccctgccc agggttggca 
tgactgtctg tgcaccgaga agaggcggca gatcctgccc tggccaatca ggcgagacgc 
ctttgtgagc tgtgagtgcc tcctgtggtc tcaggcttgc gctggacctg gttcttagcc 
cttgggcact gcaccctgtt taacatttca ccccaotctg tacagctgct cttacccatt 
ttttttacct cacacccaaa gcattttgcc tacctgggtc agagagagga gtcctttttg 
tcatgccctt aagttcagca actgtttaac ctgttttcag tcttatttac gtcgtcaaaa 
atgatttagt acttgttccc tctgttggga tgccagttgt ggcaagggga ggggaacctg 
tccagtttgt accatttctt tgnatgtatt tctgatgtgn tctottgatc tgcccccact 
gtcctgtgaa ggacagctna ngncaaggag tgaaaaactt tacttcttaa aaaaaaaaan 
nnnnnnnnnn nnnnnnaa 



<210> 579 
<211> 758 
<212> DNA 
<213> Homo sapiens 



<400> 579 

gnngtgncta nctaaatnnt tggntntaaa cgtncnttct gcatnatccc tnnttgacga 
attnggcacg agacagagtc ctgaaatatg caaatgaagt aaattctgat gctggcgcct 
tcaagaaaca gcctaaagga cctgcctgat gtgcaagagc tcatcactca agtgcggtca 
gagaagtgct ccctgcaggc cgaagccatc cttgatgcaa acgacgctca tcaaacagag 
accttctcct cccaagtcaa agggacaaat aagcctctgg gttgaacggt ttgagacatt 
ctgccttgga ccttcccttg tcaccaaaca agccaacctt gtgcacttcc accaggcttt 
cacccattcc ctgcaagcct tggtcttttg acctggccct caaccatgtg gctttccacc 
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ccjp* ™- =s c = . 

-5 SEE £££ — = - 



<210> 580 
<211> 816 
<212> DNA 
<213> Homo sapiens 

<400> 580 



tttctaaatn gcttgggttt cnaaatccct tggttgacgc cctcacctaa nnf^^ 
nantgcccnc gattcgctgn caagtctgga antcatat^g gagcct^t n^actaaaS 

==? 2=22 X£= e~ 

™p sstje ssss sss: ™: 

2=2= =222 2S2S sss — : - 

ses 22222 22222 22222 22222 
22222 r ct95 " — »<- s„ ? g = t «^t 22222 22222 

22222 22222 ..aaaetns* g cc=tn2 g » ,c»222t 

22222 22222 2™ 3 9atnca " 99 «"» 



<210> 581 
<211> 868 
<212> DNA 
<213> Homo sapiens 

<400> 581 
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tcnaannctg ctnacgccca nagcatgacc cacgatcgaa tcggcacgag gttqcaaaca 
gccttggaat agtaactctt ctcatttgtt tgggatctgg ccaccaagtn ccagaltaat 
acacggatca gngcanaagn tcatcaggct ctcggaccS agggctg^g gajaagcftc 
agcagcagaa ctgatggtga aggctcgtgt tctccatcct caactt^t? ^cttcgatc 
atacacaaga atacattngg aagggcaaaa aaatgaacac tgtcgnncat tgcagccc^ 
gtttngtgac acagatgcac agtctgcttg tgaagacctt ctctLagtg gcatttaaS 
gtccatgcca gancatggtg cttcatgaga gactgacagc tatcagggg? tS^cfctt 
° tCCtCCCCC -gtgtgtgct gatgacacat cacacctgac aSgctnga 
Saatatc! gaC ° Cntntt ^tctgfgc caacatacca catgant^ta aaacccnttc 
gg'ttcc^ tSt^ac?! f CataCntg 9CCCaaat ^ cagaantcna gagcctaat^ 

Ed-F 22=2 = s™ s~ 

a c c™ g : ™r c — a -™ - 

868 

<210> 582 
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<211> 745 
<212> DNA 
<213> Homo sapiens 

<400> 582 

ttctgaatac cttnttacnc gccttcttca ggantttcaa gacctaattc ggcacgagac 60 

cctttctgcc ttctgtttgg gacccagctg gtgttctttg gtttgctttc ttcaggctct 120 

agggctgtgc tatccaatac agtaaccaca tgcggctgtt taaagttaag ccaattaaaa 180 

tcacataaga ttaaaaattc cttcctcagt tgcactaacc acgtttctag aggcgtcact 240 

gtatgtagtt catggctact gtactgacag cgagagcatg tccatctgtt ggacagcact 300 

attctagaga actaaactgg cttaacgagt cacagcctca gctgtgctgg gacgaccctt 360 

gtctccctgg gtaggagggg ggggaatggg ggaagggctg atgagacccc agctggggcc 42 0 

tgttgtctgg gacccttcct ctnctganaa gggaggcctg gtggcttaac ctgggcangt 480 

cnngtcttct ctgaccccan tggctgcngt gaaggggaac cacccttcct tgcttgacca 54 0 

ntggccatta nctnccntna ccacttgnaa cccanggtcc canctggctg ggaccctntt 600 

ntncccccaa ngncctttcc cttgggctnt nttggantga gcaccttctn tgtnngcacc 660 

ttttanaant gnnnnnntgn tactgatttt tttgntaaaa agannttaaa anctggnant 720 

ttntnaaaaa aaannannaa aannn 745 

<210> 583 
<211> 748 
<212> DNA 

<213> Homo sapiens 
<400> 583 

gnttctaatn cttggcctac tcgcctntct ncaggatctt atcgatncna attcggcacg 60 

agatatggta tagttggaaa taggttattg tgagttattt gtagtcatgt ctttaatggc 120 

ccttgcatgg tgtctaactt ctgcaataaa tgatctgcca gtcctagtgt ctgggcttta 180 

tgcaatttgt tttcctttgt ggatgaagtg ggagtaagac ttgttgctgt gaggatcaga 240 

tgaagtggct aggatatgga cacactttac ttgaattgga aaacaagcca tgtatcccta 300 

atctgcaaaa tgtggcatgt cacacgtgta atctctgagg tttagttttt gctcaagatt 360 

gcaaaggtga cttgcttgat gctttctttg cttgagcaca catctcattc attaaatggt 420 

gtctcctttt ttgcacacag gatgcagaac ataattgacc ttttccaagt ctacttagca 480 

gaaatgaaaa tggaatcata taaatacagt attatacttt aaaataaaaa ggctgtacaa 540 

aagtttggct gacatagctt gcttctagta atctgaatgg cttatttaaa taaagttgga 600 

tctatggact cttcacagnc tagatattat cctactggaa gatgtgcctc gaaagctgtt 660 

gaaccacngc aaaaaaaccc ttcagtcagc acgtgagaaa acctgcgagc ccacatttcc 720 

cccgggacca ttctgaacat cctactgg 748 

<210> 584 
<211> 773 
<212> DNA 
<213> Homo sapiens 

<400> 584 

tttaatgctt gttacacgcc ttctgcagga tttatcgatt cnaattcggc acgaggctat 60 

gtattgtgtc ctaccatgaa ttcactccat gctagccaca ttggcctgta tggctattcc 120 

ttggacacac ctaggatgtt cttgcctctt agcttgccta cctttctctc atcatttggg 180 

cctcancgag gatatcatct cctcagagaa gccttctgtg accatgctat ctaaaatact 240 

ccagcacttc agtcaccctt tatcccatta ctctgctttt tcagaaacat tggtgctccc 300 

tgaaacatat ttgtttactt gcttagtgtc ttttctcccg cactaccatg taagcttctt 360 
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gagggttaag ggaccttgtt agggataacc actgtatcct tagagtgtga cacatagtag 420 

gttctcaata catatttttg aaactctacc ctgatgcaaa agagatatca aataattata 480 

gtttttgcat tataaatggc tttggtgaaa tccctggcac aaaactaata ataaaagaaa 540 

taaacagata atgttgaagt tctgggcctg caaaacctaa ctcttttaaa gcagtcccag 600 

taaatgtgtc attgggatcc ataagacttt gtgggaaagt caacataatt ttattnggga 660 

aaaagcattg aaccttcaaa agtnaaaact ttatnggncc aaaatctcaa ttactggggg 720 

gccgttcttt aagtcatttt aaaccctttg angccnacag ttttacacca aat 773 



<210> 585 
<211> 745 
<212> DNA 
<213> Homo sapiens 



<210> 586 
<211> 749 
<212> DNA 
<213> Homo sapiens 



<210> 587 
<211> 783 
<212> DNA 



<400> 585 

ttcaatacnt ntttcnngcc ttttgcagga tcnctcgatt cgatggaaca tgagtggaag 60 

tgggcagtct ttttctttcc ctatcagctg agtgaatgaa gatttagagg gcagcagagt 12 0 

catgacatgg atgacgttgg gtctctggat ggctaaatgg aagacccgcc ccccaacgcc 180 

dctctacccc cctgctttga actatgcttt gagaaatgag cttatgagac cactgagact 240 

tgggggctgt ttgttcagca gttcacctac acttattagg aaaggttgac ttcttgtact 300 

acgcctttcc ttaaatcatc ttttgtataa ttctcagaac actgctggtt tgggtggtct 360 

cacacatttc tcacatccaa attttaaaga tttcatgaat gttcattaca gtggatttat 420 

ttttctcttt ctgcttctcg gcatgccctc tcaatttggg agaaatctct aattggatga 480 

ctttggtggg accacaggag tgtaaggatc gtaattccct cacttcatcc cctgcaaatt 540 

aaagcctggg cacttaagac tcactcaact gaatcttgat atgtgggact ttanatctta 600 

agcaaatgan gcaaaagaag gaaaagacag ttgagaaaat caatctctga agttcagcac 660 

ttgatttcca ccgtggaccg gactcctgca nctttgcatt ngccttggtt cctggccatt 720 
ttccnaaccc gggttccttt ttgan 



745 



<400> 586 

tgttctaata ctaggtntac tcgccttttg caggatctna tcgattcnaa ttcggcacga 60 

ggggtcctgg tgggagtnnc atncagcagn ganngcattc tttccncaca ncagtnaacg 120 

gtcttattaa nagccaccac tttnctgang cctgtacagg ccttgnnggt tnggngaaca 180 

gaaatnncgc aggcacttgt accttcaagn anggacttgt gcctnactgn nagggttggc 24 0 

gttgaccttg gctcnacnga catacccant ctgacttnna acgngcncgt ctnagcttac 300 

gctagactgc acnnccaagn ttgcangcct nttntgnctt ccctgcattn accaatgaca 360 

gtacgaccaa cagtcaanga aaagtgccaa gatatatcta tcccatttct tctacacctg 420 

tanattcctn actatgctca aactatgtgg ngcaangaan actggnngac atttttagtc 480 

aatgatgctg acaattaatt actggtgngg ccaggcatat nttcacggct gcttgtgatg 540 

ccaacnaaga acgggcccca gcccatcctt actcctngnc cccaaanaga tccagtgnga 600 

atgggaagct gnnannacca acccaactnn tgatttacca ccaacnccaa anatcacgca 660 

tgnnnacagc aaaacaacaa cncnatgcac ttaacaagna nccnaaaant naactcgngc 720 
ctctaaaact attngggant cctttanct 
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<213> Homo sapxens 
<400> 587 

gttctaatnc ttggcctact cgcctntctg caggatcttn tcgaccttat tcggcacgag 
cccaaggcaa gctgttaaca aaatcaacct gggccaatca tcaaagggtt ggacctaagg 
ttgctatact caatagaaca agcattttaa ataaatttct cgtaagttgt tgctttcttt 
atgtggtggg tgtggcttta aagagcacaa aaccacaaca aatcaaagag tagctcgggc 
ttgtcttttg ctttatggct gagggtttga aggatgattc atggacttgt gaatgccagc 
cccagtcccg gcttaggtct atctgccaat accaccaggg ccaacaaatt cacgcaacaa 
attctctcat tttttacagt ttatcagttg cactcatagt tattgtcata atcactcccc 
acagtaacct gtaaggcata taaagtagct attttagtaa gataaatgat attttatata 
tgttatgata agataaatct tatcatttta agaagaaact gagctcggag agatgaaatg 
acttcctcag ttgctgctgt aataaaagtc tactttttgc taaaaaaaaa aaannnaaat 
atnntntann attnnantaa naanaaaaac ttcgagccnt tttnaaactt tnantggagt 
cnntttntcc cgtaaaatcc nnnactttgg atnaanannc catttngatn aagtttttgg 
gacaaacccc ccaacttaga aattgcnntn ggaaaaaaaa ntgcntttta ttttgnggaa 



aan 



<210> 588 
<211> 771 
<212> DNA 
<213> Homo sapiens 



<400> 588 

tcttctaatg ctggttacan gccttctgcn gatccctcga ttcgaattcg gcacgagata 
ctttttaaac cttttttggc agctcagatg gtgtaaattt taaaattttg tataggtatt 
tcataacaaa aatatgtatt tcttttttgt tattttatct tgaaaacggt acatatttta 
gtatttgtgc agaaaaacaa gtcctaaagt atttgttttt atttgtacca tccacttgtg 
ccttactgta tcctgtgtca tgtccaatca gttgtaaaca atggcatctt tgaacagtgt 
gatgagaata ggaatgtggt gttttaaagc agtgttgcat tttaatcagt aatctacctg 
gtggatttgt ttttaaccaa aaagatgaat tatcaatgat ttgtaattat atcggttgat 
tntttttgaa aagatgaacc aaaggatttg actgctaata ttttattcct tacacttttt 
tctgaataag tctctcataa tgagtgcagt gtcagactgt gcctactctg atggtatngt 
gccatttgta aaatnanaat aagagcagaa aaaacacaaa nangagaaca ctggnttcag 
acattcantg gggcaagtta aattatggga ctgcaaaaat aatggatttt ttattcaaag 
aaaagcttta aaaagtttta ttatccanat ttacaaccca ctanttaagc taaataancc 
tactttnaaa aatngnaaat ggttnctatc tttataangt gccaanttna n 



<210> 589 
<211> 844 
<212> DNA 
<213> Homo sapiens 



<400> 589 

tncactnnaa tccttntnta aaaagccttc tgcntgatcc catcgattcg aattcggcac 
gaggccagag cctagaggag agatcaaaga cnttngccga agtgaagccc attctgcaag 
caactgggtt cccatggcat gtggtggcct tagaggaggt gttcagcctg ccaccgtcgg 
tgctttggtg ctctgcccag gagctggtgg gatccgaggg ggcctacaag gcggccgtgg 
acagcttcct ccagcagcag catgtgctgg gggccggggg tggtcctggc ccgactcaag 
gggaggaaca gccaccccag cccccgctgg acccccagaa cctggcaaga ccgcctgccc 
ctgcccagac tgaggctctt tcccaactgt tctgctcaat gaggacactg actgccaagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
783 
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aggagcttct gcagaccctg cggacccacc tgatcctnca cgtggcccga gcccacggct 480 

actccaaggt catgactggg gacagntgca cacgcttggc tatcaagctc atgaccaacc 54 0 

tgcnctgggt ccaaagggcc ttcctggcct gggatacngg ctttcttgga tgaaccngna 600 

ccgggngaac gtnggtggtn ggtgccggnn cattgcctgg gaaccaccac ccccttnaaa 660 

angaangntc gnttatttct aacaaaaccc ggnccttgnt tcntaccntn ttccctntct 720 

tggnnnnttt tnaanacncn annncccaat tngnaanaac ccnaaaangg gnccctttgn 780 

aaaaaaangg ggccnatatn ntntntcana cccgnggnct ttgaatnngg aaaangccnc 840 
tnct 



<210> 591 
<211> 765 
<212> DNA 
<213> Homo sapiens 



<210> 592 
<211> 757 



844 



<210> 590 
<211> 767 
<212> DNA 
<213> Homo sapiens 

<400> 590 

tctaatgctt ggntctngcc ttttgcggat ctttcgattc gnattcggca cgagagaacg 60 

ttctcaggtt gaccagctgc tgaatatttc tttaagggag gaagaactta gtanntcatt 12 0 

auaabgcatg gataacaatc ctctgcaagc ccgtgcagcc cttcagacag cttatgtgga 180 

agttcagagg ctacttatgc tcaagcagca gataactatg gagatgagtg cactgaggac 240 

ccatagaata cagattctac agggattaca agaaacatat gaaccttctg agcacccagg 300 

tttggcatag aaatggtacc ccttgttcaa aatgaacaag aagccttaga tttggatggg 360 

gaacctgatc tgtccagtct agaaggattc cagtgggaag gtgtttccat ttcctcgtcc 42 0 

cctggcttgg caagaaagcg aagcctttct gagagcagcg tgatcatgga cagagctcct 480 

tctgtgtata gcttcttcag tgaggaangt acaggcaaag aaaatgagcc ccagcagatg 540 

gtttcaccta gtaactcatt ganggctgga cagaaccaga aagcaaccat gcaccctcaa 600 

acaaggaagt nacacctcng gctggccttc ccttccgaac aggtgaaagg ggcttgaaaa 660 

atgttgctac cccaaaggcg acattnntgg caccaaatta tccctcttga ccnntttaat 720 
accttttgat tncatttngg caaaagactt tgnaccagcc nnggaga 



767 



<400> 591 

tctttgaatc cttttgtaaa agccttttgc atgatccctc gattcgaatt cggcacgaga 60 

cttcttgttt gcctttttta taaggaaatg ttggagagtt acatcattgc taatgtagaa 120 

atgttaagtg gaaaaatata cagtttggta aaataaacta gattctacat ttatttgtgg 180 

gtttttttcc cctcctttct ttccacagca cttttgatat caagcaagtg gcttcctttt 240 

tgagatatta aaaaaaaaaa gaaaaggaaa aaagtaaatg aagcccaact acctaaccct 300 

ttcttatttg tatttgtttt agtattgtga agttgtgtta aatagtacta gctagaaata 360 

caaatttctg gttatcattt ctcttccctg tggcacttga cattttaatt gtcttaaagt 420 

ttttgaagtc atcttctggc cccttgagta ctgccagagg caaaagatgt ttgtttctta 480 

ttcattccac ttttgtctcc tgggatccct tctgtagcct aaagtatggc tgggaaatgg 540 

acttgagaag attggcttga attangatca taatcatgtg tgatcccatc atgaattcat 600 

tggaatntgg ggtncatgta angcaatcnt tctggtgtaa atcttccttt ttttaatgna 660 

catatanttt tggaaaaaat tttgaattaa ccctgaaaat ttttaaaaaa gccctcttan 720 
aactattann ggaggtcnca ttaccctaga atccanacat tnant 
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<212> DNA 

<213> Homo sapiens 



<400> 
tnttcnaana 
gattgagttc 
ctgcaggcgt 
gccccggccc 
tgtacacgcc 
acccaactgg 
aggcgcgttt 
gggcctcctc 
gctcttgagg 
gttttaccag 
ttatttcttt 
tatttacctt 
ttggaatgca 



592 
ctngttctng 
acagtgtatg 
gcacactggg 
ctatacccca 
tncaaagtgg 
gggcaaaggt 
gtgcatatct 
agtgaaggtg 
aacccaggct 
ctcccaaggt 
ttaaaaaaaa 
agatncagac 
ntgaaaaaaa 



ncttttgcag 
ttctgggggc 
gtggacgatg 
gtgttctctt 
agctgggagg 
ttgaagacac 
gcctcccctg 
gtggggctgc 
cggaggaccc 
ggacttcant 
aaaaaaaaaa 
atgataagaa 
atgctttatt 



gatcccatcg 
gctggtgcag 
ggtggccccg 
tgatccagtc 
tagaagggga 
agcctccccc 
tctctaagga 
cggatctggg 
tggaaaacag 
gtgtgtattt 
actcgancct 
tncattgatg 
tgtgnat 



attcgccaaa 
tcagcggtcc 
caggtgtaca 
ccgaaacaga 
ggacactggt 
gccagcccca 
gcagcgggaa 
ctgtggggcc 
acgggtctga 
gtgtaaatga 
ntanaactat 
aattttggac 



tctgcctaga 
agtctccagc 
catttgggtg 
gggagccttg 
ggttctactg 
agctgggccg 
cggagcttcg 
cttgggccac 
gactgaaatt 
gtaaaacatt 
tagtgagtcc 
aaaccacaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
757 



<210> 593 
<211> 766 
<212> DNA 
<213> Homo sapiens 



<400> 
tcttgaatnc 
attggtgtgt 
agaaacatcc 
gtaaggcatt 
gtcacagatg 
tggtgaaggt 
cggctccagc 
ctagggggta 
aagttcagct 
gggaatgttt 
tcttatttgg 
ngtgtgttgc 
aaaatantnt 



593 
tngttnntgc 
gagtgttttt 
agaatgctcc 
gcctgctctt 
ccctgtctat 
ggtgagccct 
ttctcactgt 
gagatttatg 
cttagaaaat 
ccaactagat 
ntttgtnaaa 
cccntntgcc 
anntnanant 



ctttttcgga 
tgatggtgca 
tccccatccc 
gaccctttgg 
gcagagggtg 
tctgaggccc 
aaagttgtca 
ctaatgagtg 
ggaatcttac 
ctccttcaga 
caaaagcttt 
gtttcaaata 
nntntancnt 



tccctcgatt 
ggacccggag 
ccaatcccag 
cccatctttt 
gcgtgggatg 
tgatagaact 
tcctggcaga 
aatattgcac 
ctgaccccta 
agagtccctg 
tgggtctcgt 
aaaggtttgg 
ttnttnncnt 



cgaattcggc 
gtgctttcct 
acagcaatta 
tatttttaaa 
ggtgaccact 
ttccaggagt 
ggcagccaat 
cactagtgac 
gtgaattatg 
tgctggaata 

ggggtgtgtg 

taccaccttt 
tanant 



acgagagaac 
tgccaagaat 
tgtcagccct 
aaattcccat 
aagtttaggc 
tcatggtccg 
gcttttcatt 
tttctgttta 
tacataagca 
ggtcactgaa 
tgtgntttgg 
tcaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
766 



<210> 594 
<211> 754 
<212> DNA 
<213> Homo sapiens 



<400> 
ttgnttagga 
aagtctcaaa 
gcgcgaagtc 
tttttccttc 
taacaatgat 
ttctgacatc 
cgtattacgt 



594 
tcccatcgat 
ctccagaagc 
ctcgtccaga 
ccccactttt 
attttgtgta 
ccccaaaaaa 
agatccagac 



tcgaattcgg 
cccacctccc 
gaaggcaacg 
taaaagtttg 
gaattctggg 
aaaaanaaaa 
atgataagat 



cacgagggaa 
tgagctcagc 
gcgagaaaca 
gtgtccaagt 
atcaaaatat 
aaaactcgag 
acattgatga 



ggcagtggga 
tcctctgcca 
aatccaacat 
cacttgacaa 
aatttcaaaa 
cctctagaac 
gtttggacaa 



ggagaggacc 
agccccctca 
cctgggctgc 
acccagaccc 
ataatatatt 
tatagtgagt 
accacaacta 



60 
120 
180 
240 
300 
360 
420 



172 



BNSDOCID: <WO 9933982A2_I_> 



WO 99/33982 



PCT/US98/27610 



gaatgcantg aaaaaaatgc tttatttgtg aaattttgtg atgctattgc tttatttgna 480 

accattataa agctgcaata aacaagttaa caacaacaat tgcattcatt ttatgtttca 54 0 

aggttcangg ggaggtntgg gangtttttt taattcngcn ggcgcngcnc caatgcattg 600 

gggccccggt nncccanctt ttggntccct ttaagnngan gggtaaantg ncgcncttgg 660 

cntaatcttt gnncatnggt tggnttnctg nggngnaaat tggttttccn ggnnanaatt 720 

tccccncatn ttangatccc nggnngntnt aang 754 

<210> 595 
<211> 767 
<212> DNA 
<213> Homo sapiens 

<400> 595 

ggtttaatgc tgttnnaanc cttcttnanc ctttgtacag catccctcga ttcgaattcg 60 

gcacgaggaa cgcttccatt ttatacctgt gtctagttag tttctgccta tctatccaag 120 

aagcttttat caagggtcca ccatgtgcca gccactgaag tagatataaa tacaaggatg 180 

tgtaaggtat ggatgatggt atacgaactg tcatcttact ggatttgtcc gctctgttaa 240 

agatacggtt ccgaaaactt tttaaagccc tagagagggc tttaaggcaa tgtagcatca 300 

tatatagagg catnaacctg ttcatatctt tctatttaac agaactgtgc acctgggcac 360 

aagggtgtgc acaacaggat gtgtacagca gcactgttaa agtgtancac atccatacta 420 

cangatctta tgcaactgtt ggaaagaatg aagcgatgct gcactgtggt catgcagtga 480 

tctctaagac atattaactc gaaagcaaaa ggtttaacaa tgtatnacaa actgggctgc 540 

aattgactcg cgcctgtaat cccagcnctt tgggaggctt gantaaggcg gatcacctga 600 

ngtcangagt ttgagaccaa acctggccaa tgttggccna aaccnctgct tctactnaaa 660 

ctacnaaaaa ttaacctngg gcntggttgg ctccgtgcct tntaatcccn gcttactcgg 72 0 

caatgcttga gngaangnan aattngcttt gaacctnggg gaggnng 767 

<210> 596 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 596 

tnttnaatnc tnttttaatn cttgctgcan gatctttcga tgatcccatc gattcnctgg 60 

tctcgaacac ctgacctcag gtgatccatt cgncttggcc tctcgaagtg ttgggattcc 120 

aggcgtgagc cactgcggcc agcacatttc cacttntaga tcctactcca taccacaggt 180 

ttcatttaag angaaaganc tanataaatg tgctcttntg gataccccac cctgacagan 240 

tgcattgtta cacagntanc atgggttgac actgcaanct ggcctgtcag ccatnggagg 300 

ngtttannga aaggcanatn atgtnactct gtgncagggn gccatntgct tacccntnac 360 

ctagcatang gggnttctac gggtgacccc nagcatattt ctaggttact tatgggcaga 420 

tttgtaagtg acaaaactcc agctgatgct gggaatgggg agagggccct tganggactt 480 

tgtggntttg tgcttctggt ttcctggcca accccagggt cacttgtctg gagcccagct 540 

gggcactaat gtctgccanc gactatntta cagtgtataa atgattcctc tatttgggga 600 

gagatcttcc aatccagaag agcccctntt ggactgcctg ggttaaatct gcatagcana 660 

agtggttgat gagtcatctg aagaaattca gccccaactt nncaacctgc ccttcctgnt 720 
tcctttttaa tggnggcctn tgg 



<210> 597 
<211> 786 
<212> DNA 
<213> Homo sapiens 
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<400> 
ngtttnnncc 
aattcggcac 
nanaggcaga 
gagaatttna 
gtggatgtgg 
gctccgatgg 
ccagcctaag 
ntgantggnn 
cacaatccan 
tgtggtnaaa 
acatcacatn 
ngaacncata 
taanttttgn 
ccagnt 



597 
ngtttttaat 
gaggacanac 
ngtggccctg 
aagatatggt 
agtttggnac 
aggaacccac 
ttnctgactg 
aaatctttca 
tgnnccttac 
gatgcctatn 
tcctnttgga 
agaatgcacc 
nnccccaanc 



ncnttgctac 
cgttgagagg 
ggaacagagt 
gggaatgaca 
ctngggatcg 
agtggtggag 
gatgtcaaga 
gatcanncca 
taagccgtgg 
atgaccnatt 
gattaaattt 
cccnggntag 
attaaantgg 



tngctctttt 
acgtggaggc 
tttatgacnc 
aaatagcagn 
ngnaaagata 
gagttccant 
agaaacccaa 
gaccatancn 
tgattnacaa 
ctccatngtt 
tnggtnancn 
gcccngtnna 
ctngattccc 



tgcaggatcc 
ccnttagggg 
ttttnaccat 
cataactgaa 
ccagtgatgt 
gcanccngga 
nttataanag 
tgagtttaac 
gtcataaatc 
ntccngaaac 
tnccttcgtc 
aaggttnatg 
ttaatntttt 



catcgattcg 
gtntgcncng 
anangaangn 
gacaacatgg 
ggagccaact 
agaggagtat 
atgactntan 
atccgnaanc 
cattanatga 
ccgtcaattg 
cttgggcatt 
aaggccntta 
cctcccnaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
786 



<210> 598 
<211> 809 
<212> DNA 
<213> Homo sapiens 



<400> 
ngttttnnnn 
gaattcggca 
gcagaggcag 
gagaatttga 
gtggatgtgg 
caccgatgga 
cagccaaagt 
atgagtggaa 
cagaatccac 
ggtaaaaaga 
acattccaac 
ccntgaagga 
cantaanaaa 
atggtttgnc 



598 
cnnnttttct 
cgaggacaga 
aggtggccct 
aagatatggt 
agttcgaacc 
ggaacccaca 
tcctgagttg 
aaatcttcaa 
tgttcttcca 
tgactatgag 
ttggagatta 
aggcaccccn 
ccttttggga 
cccccnaaac 



aatgcttgct 
ccgttgagag 
gggaacagag 
gggaatgaca 
tgnggatcgg 
gtggtggagg 
gaggtagaag 
gaacagacca 
aagccgtggt 
gaccattccc 
aattttgggt 
ggtgaagggc 
ccccccttaa 
cccnaaant 



tctcgttcct 
gacgtggagg 
tttttgacgc 
aaatagcagt 
gtaaagatac 
agtcccaggg 
aagaaaccca 
gaccaaagcc 
gattcacaag 
atgttttccg 
aacccttcct 
ccngggggaa 
nccaataaaa 



ttgcaggatc 
cccgagaggg 
ttttgaccag 
cagaactgaa 
cagtgatgtg 
caccccggaa 
agttcaagag 
tgagtttaac 
tcaaaataca 
gaaaaccccc 
ttgtncttgg 
agggattcan 
tttggtngaa 



ccatcgattc 
ggtatncncg 
agaggaaagc 
gacaacatgg 
gagccaactg 
gaggagtctc 
atgactttag 
atccggaaac 
tagatgatat 
cattgacatc 
gccttngaac 
ggnaaggggc 
ttgcnangga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
809 



<210> 599 
<211> 759 
<212> DNA 
<213> Homo sapiens 



<400> 
tttntaatnc 
tcggcacgag 
gaaacttcaa 
tcttctaaat 
agtacggtgc 
atatcacctt 
accaagtgtt 
atggagcttc 



599 
tttttcnaat 
ccaggttagc 
gcaaattgga 
ctactccaga 
cacttgcctc 
ctgggttatt 
cagaatttgc 
tatgacagtg 



gctngcttca 
tgctgaatca 
aaaagaaact 
gacccagtgc 
aaaagttact 
tactcattgt 
ttgactctaa 
aataaactat 



ggannntntg 
aagcttcaaa 
tgtaagaaat 
cctcaacagt 
atggtgctta 
gccaggacct 
cctggagagc 
taattgaagg 



cangatccct 
cagaagttaa 
cacaccctat 
aaagactttt 
agattgtctt 
ggcattttca 
ttcttaagtg 
aaaatgttat 



cgattcgaat 
agaaggaaaa 
tctatatgtg 
ctttaataag 
gatctgacat 
tgtgcctttg 
atgccccttc 
aattaatgta 



60 
120 
180 
240 
300 
360 
420 
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<210> 600 
<211> 769 
<212> DNA 

<213> Homo sapiens 



<210> 601 
<211> 755 
<212> DNA 
<213> Homo sapiens 



<210> 602 
<211> 773 
<212> DNA 
<213> Homo sapiens 



tctatttgct gcattgtata tggattaaat gataaaaaac aagtaatcta ccctcagagc 540 

catgtatttg agaatgcttc aatcatattt tcctatgtac ctttttttta taaacttagt 600 

tttagactat gttgtaaaaa tggggaaagg ttgtaaacta tgtngtaaaa aatngggaaa 660 

tgtggcttta aaatatatnc attatatttg gttcaaggat tttggcaggg gntaaaggaa 720 
ncnatggttc aatctttgna tttatatacc ntgatttaa 



759 



<400> 600 

ttttaatacn tttttnaatn cttgcttncg ntcctttgca ggatcccatc gattcgaatt 
cggcacgaga gcaattccac tcctagctcc acccacaggt aattgaaagc aaagacgcaa 
acagatgcct gtgcaccaaa gttcacggca gcatccttcg ccatagtggc agcatccgtc 
gtcacagcgg natcatcctt catcatagcg gcagcatccg tcgtcacagc ggcagcatcc 240 
ttcgccacag cggcagcatc tgtcgtcaca gnggcagcat ccttcacca* ac ™^ a ~ ca 
tccttcgtca tagcggcagc atcctttgcc atagcggcaa ggtggaaacc c^gtccatcc 
actgaggcgt gcatagacta aacatggcca gtccaggcac tggaatccag gccgtanaac 
ggngcccacn gtcaaaagga atgagaccct gatgcactgg gcgacacaga cgggcgacac 
agacttggag acatcatgct aagtgaaaag ccaggcacac ggagcggacg gggtgatcct 
gctcacgtga tgtgtcccga atgggcacnt tcagagggga agaanggaga tggcgcttga 600 
cngtgnccgg gacnggggtt gggagcgacc ggttgttggt ttngggtttc tttctngggt 660 
gaaggaaatg tttttgatat tggggccgtt tgggtgatnt ttgcattacc ctttgaatat 
gcttanaacc cnctagaaat tgnnacactt tttaaatngn ttggaaatt 



60 
120 
180 



300 
360 
420 
480 
540 



720 
769 



<400> 601 

ntgtttaata ctattttcta atacttgctt tcgttctntt tgcangatcc catcgattcn 60 
aattcggcac gaggagacag cagcccccag ggaatgaagc tgatgccaga gtcagacccg 120 
aggaggaaga ggagccactg atggagatgc ggctccggga tgcgcctcag cacttntatg 180 
cagcaactgc tgcagctggn cctcaagtac ctctttatcc ttggtattca gattctggcc 240 
tgtgccttgg cannctncat ccttngnagg catctcatgg tctggaaagt gtttgcccct 300 
aagttcatat ttgangctgt gggcttcatt gnnagcancg nnggacttnt nctgggcata 360 
gctttggtga tnagagtgga tggtgctgtn anctnctggt tcangcanct atttctggcc 420 
agcagatgta nnctatatct gtgattactg gcacttggct acagagagtg ctggataaca 480 
gtgtagcctg cctgtacagg tactggatga tctgnaanac aggctcagcn atactcttac 
tatcatgcaa ccaggggccg gttgacatct aagacttgnt tattctatag ttcnagganc 
acaatggaat atgatccctt aactcctgat ttgggatcat ctgaaggacc aaggnnggca 
gtcttcgaag tggaataaaa tagccccggc ngtngtgact tgcacctata ttcccagact 
tttgggaggc naannttnga aggattgntt gccct 



540 
600 
660 
720 
755 
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nttgtaatag ctggttteta aannntngnt ttcaacccct ttgcatgatn ccatcgattc 60 
gagcaaatca agatcttcag gtacagttgg accaggcact: ccagcaagcc "ggatccca 120 
atagttaagg caactctttg tttgcagagg tggaagatcg aagggcagca atggaacgtc 
agcttatcag tatgaaagtc aagtatcagt cactaaagaa gcaaaatgta tttaacagag 
aacanatgca gagaatgaag ttacaaattg ccacgttgct acagatgaaa ST99 tctcaaa 
ctgaatttga gcagcaggaa cggttgcttg ccatgttgga gcanaagaat ggtgaaataa 
aacatctttn aagtgaaatt ngaaatctgg anaaatttaa gaatttatat gacagnatgg 
aatctaagcc tttagtcgac tctggtactc tggaanataa cacctattat acagatttac 
ttcatatgaa gctggataac tnaaaacaat agaaattgaa ngcactaaan gtgaattgtc 
atacaagcga aatgaaancn ttatttgana gccngcgggc ttctaacata ttgagcgata 
actttttgca aatgaaagat gcccttcngc tttntgaatt gnaaaatatt ^aacctgan 
agntnanctt agntgaattg aaacttaaat ttgaaccctg nacnanaccg gttaantgcc 
tgttcctgat aaaaanaagc cntnangtgc ttncctgntn gatttanccc ccg 



<210> 603 
<211> 784 
<212> DNA 

<213> Homo sapiens 
<400> 603 



tgctttataa tagctgtttt taaatncntn gotttgcgct cnntttgcag gcatcccatc 
gattcgaatt cggcacgagg gggaoatcag tgatcgtaag totcctgggn «=gttattct 
canatLggt gacggagcta agaottcgag accatctcgt cctttntgta tcgcggaaac 
ctgangaacg agccggcggc ggtgacctgc acgagaagcc aggctaactg 9T9tgaagtac 
catgcaagca tttcttaaag gtacatccat cagnactaaa cccccgctga ccaaggatcg 
aggagtagct gccagtgcng gaagtagcgg agagaacaag aaagcoaaac ^cgttccctg 
ggtggaaaaa tatcgcccaa aatgtgtgga tgaagttgct ttccaggaan aagtggttgc 
antgcttgaa aaaatcttta gaaggnngca natcttccta atctcttgtc ttacggacca 
cctggaactg gaaaaacntc cactattttg gcagcaaact tgagaactct ttgggcctga 
acttttccga ttaagaattc ttgagttaaa tgcatctgat gaacctggaa tacaanttag 
nttcganaag aaagtgaaaa atttttgctc aattaanctn gtgtcaagga ^aatngnttc 
anatgggaaa gccgttttcc ncetttttaa gantgggaat tcttngatga ngncnaattc 
ntnttganct taactgnntt angcagottt taaaaaanta ccattggata aangagtccn 



aant 



<210> 604 
<2X1> 801 
<212> DNA 
<213> Homo sapiens 



actxgggcca gag^daauya 3 

nggaggagtt ttagggcttg gggagaggga ggacacatnt ggaggaaatg ^cttctttt 
taaaagcana naacacataa ccttacaact gcctggcaag cccaatatca cttgtttggg 
ccctancggg actccaangn agccacacgc cccttctgga agggtgtgng catgtnaant 
gtgtgccanc gcgtgggctg gcgtgtgaan atctatnaaa taagtatana tgggngtnta 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
773 



60 
120 
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240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
784 



<400> 604 . „ f-n 

gttncnnctn aacccttttt tgaaatormt ngcttctact ctttggcatn "tnccatcg 60 

atncggcccc gtgtggggag acngacagca ccctttttnt ctggcatttg cccttgangc 120 
tatagcgcct cccctctccc ctcagagggc acagctgcag gcctgaccaa ggccacgccc 
ggctctcgtg ctctaggacc tgcacgggac ttgtggatgg gcctggactc tccagaaact 
Jcttgggcca gagcaaanga aaacctcttg ttttaaaaaa attttnttca nagtgttttg 

333 . . ™^^o*-T^t- rrrrarrcraaata CTCCttCtttt 



180 
240 
300 
360 
420 
480 
540 
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ntatatgtgt ntaaaataaa ngantggaca tatttggncc tctgngnana nncttngaat 600 
ctaagncaag agtnnntctn gaaaaacnaa ananagtnct ntntanannt ttacgtaatn 660 
atcaatactn tntccacntn accctnctnn tanntntncc natatantcg antaattcnt 720 
cactcntnna ttcctngtna acacnaatna atnnaactat naaatatntn tnctnnntan 
tngacatann catncnnncc g 



<210> 607 
<211> 788 
<212> DNA 
<213> Homo sapiens 



780 
801 



<210> 605 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 605 

gnttctaatg tggttcnaan acttgctttn gctcctttgc aggatcccat cgattcgaat 60 

tcggcacgag agcctcgcct gggccggcct gtggctccca ttttcctttc agcgggacaa 120 

aggggacttg ttaccaggcc attttctgga tggcctgtga gatctctgcc cctccaagac 180 

cctccaagtc tgagcctgac ccacagctgg gacactgaat tcagccctgg gaaccatggg 240 

ggcttctatc tggcaccagg ctgcagcctc cccaatccca gcccactttcr ctgtgtctch ™n 

ggcgggctgt cctccttggt gggagctgtc ctgcacactg taggatgctt aaaggtatcc 360 

ctggccttca cccatnccta gccagcagct cccagtcaga caacagccag aaatgtctcc 420 

agactctgcc cagcctccca ggtagccacc ctcgagacat gacctcagag tctctgtgtc 480 

tcctagaagc ctgacagaga cccccanggc agtgggtggg tggcgggcta gagacccttg 540 

cctgtgtccg ggaccctggc gccgntcttc cctcctgtgg atcccttcgc acttacaagt 600 

gttctnaant gggcagacgc ctgggcaccc cttgggccct gcccaancat ggccatngng 660 

cangcttttt naacccgcat nggntttcca ngcctggtga atcttgcttt tccanggacn 720 

nnttggaacc tttcctncgg ggcggggccc ccnagcnct 759 

<210> 606 
<211> 809 
<212> DNA 
<213> Homo sapiens 



60 



<400> 606 

tctncgtnaa tcnnnnnttt aaaagccttt gcttttgctc nctttgcttg atcccatcga 
ttcgtgactt tgtacctggt ccaagctgat ggggttttgc tgctgttgac ccaggcagga 120 
gtctgactag agaacaaact aaggttgctg caacaaacaa ggacctcttc caagaagggc 180 
tcccaggcct ggcgcagtga ctcatgcctg tgatcccagc acttgggagg ccnaggcggg 240 
tggatcattt gaggccagga gttcgagacc agcttggcca acatgatgag accccgtctc 300 
tattaaaaat acaaaaatta nccaggcgtg gtggcgcctg tagtcccaac tactcaggag 360 
gttgaggcag gagaattgtt gaacccggga ggcggangtt gcaatgagcc aanatagcac 420 
cactgcactg catccttggg tgacagaagc gagactccat cttaaaagaa gggctcctgt 480 
gtctacgtca tggtggggct anagagangt cccngcagct gggctgtgtt gagtganngg 540 
ctnntctttt naannccagg caatagtttg tcttgactct gtccttttct gngtccacat 
gacattttac atntttncnn agtttnctta atttaaagtt gnctaatttt accattatac 
attttnaatt ggcatttctt ttaccnatnc tttttgtntg aaaatggtan tntttgaaat 
cngnatcngt tctaatgngn tntattttna ccnaatgcca atnntacctn ctttgnaana 
atntattcgt tttcnaagnt tnaacctct 



600 
660 
720 
780 
809 
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180 
240 
300 



540 
600 
660 



<400> 607 

tntttctaat acnagtttnc aagncttgct ttnnnatccc tttgcaggat cccatcgatt 60 
cgcaaggccc gaggtgccat cccctctggg aagcagaagc ctggtggcac ccagagtggg 120 
tactgttcgg taaagagctc accctctcac agcaccacca gcggcgagac agaccccacc 
accatcttcc cctgcaagga gtgtggcaaa gtcttcttca agatcaaaag ccgaaatgca 
cacatgaaaa ctcacaggca gcaggaggaa caacagaggc aaaaggctca gaaggcggct 
tttgcagctg agatggcagc cacgattgag aggactacgg ggcccgtggg ggcgccgggg 360 
ctgctgcccc tggaccagct gagtctgatc aaacccatca aggatgtgga catcctcgac 420 
gacgacgtcg tccaacantt gggaggtgtc atggaagang ctgaanttgt ggacaccgat 480 
cttctcttgg atgatcaaga ttcantcttg cttcatggtg acgcagaact ataaagccct 
gtgtncactt atagacagtg aaaacccacg ggtcttcatc tttattaatc nngaaacctt 
ggaatgcctg ctttgttttg taaccccttt ttaaaaccta cctgttttta aaaagtggtc 
atttttantt nacgntttan aaanaaaaan tcctatttct ttttcctttt nattttaaaa 720 
aaaaattngn tttttgttgg ggggntttgg ggggaattaa aataatttgg cccccaactt 780 
taaaaaat 788 

<210> 608 
<211> 796 
<212> DNA 

<213> Homo sapiens 
<400> 608 

tcttttaatg ctttttncaa gccttgtttn aaatcctttg caggatccca tcgattcgaa 60 

ttcggcacga gactaccccg gctacggttc ccccatgcct ggcagcttgg ccatgggccc 120 

ggtcacgaac aaaacgggcc tggacgcctc gcccctggcc gcagatacct cctactacca 180 

gggggtgtac tcccggccca ttatgaactc ctcttaagaa gacgacggct tcaggcccgg 240 

ctaactctgg caccccggat cgaggacaag tgagagagca agtgggggtc gagactttgg 300 

ggagacggtg ttgcaagaga cgcaagggag aagaaatcat aacaccccca cccnaacacc 360 

nncaagacag cagtcttctt cacccgctgc agccgttncg ttccaaacag agggccacac 420 

agaatacccc acgtttttat ataaggagga aaaccggnaa aanaatttaa aagttaaaaa 480 

aatanccttt cngttttaca ctactgntgt agactcctgn tttcttcaan cacctgnaga 54 0 

ttcttgattt ttttgttgtt gatgntctct ccattgcttg tngtttgcnt gggaantttt 600 

atttaaaaaa aaaaaaaatt cttgtgagtn gactttggnt tttaaaccan tgntagattt 660 

taacngnacc cttaatgggt tgtacntata tgntttnaaa acatgnnaan aaatatttaa 720 

tgtaaaggnn ctgttnntaa atntaaccac ntanagaant tnnaaannnn ttnanccctt 780 

tagaacnatt nntgng 796 

<210> 609 
<211> 790 
<212> DNA 

<213> Homo sapiens 



<400> 609 

gnnntttaaa nacctntttc aatncttggt ttnnaatcnt tttgcaggat cccatcgatt 
cgcatccagg gagaacctcg gggctgggac acctcctggc cctcaccctg ggtcatgttt 
acagtcctca gtgccccaca ccggtggccc cctgaggaca cctccaccct gaccttgatt 
ttcccaaacg ctgcctcttg gtgacagact cagcccaaaa ccccttcctt ctgtctctgg 240 
agacccttga gcttggggaa atatggaggg gtgtgtgtct gcaatcaagg cctctgcagc 300 
tcacggctgg cccggtgggc tgggacttcc gtctgaattt taaatactta gggntcattt 360 
tttttctctg gcaacaaagc ttgatgtttt cactgcttta gtttcctgtt tgctggtggg 420 
aggggatacg gtctgtgact ctggacttgc tctgggggaa cagttgtcac tgcccccngg 480 



60 
120 
180 
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gagaggggca gcttgggctt ggaagaaagc acaccccnga gaccagagcc ccttcnagag 54 0 

ggatncttgg ctgcttcatt gnctttcccc cagcaagccc tgctcttcca caagcncctt 600 
ntggggtctt gggtatggtc ccccgntcac cttctttcca nantccctga nntggtgtag 660 
ggttgtgggt tggcacangg aattttgggg cattggggaa ggggntttca aaacttttnc 720 
caaanacccc cgtgttcctn ngnaaaattn aanttggtgg gcttngggtg ntnaccccca 780 
antcttngnc ?9Q 

<210> 610 
<211> 786 
<212> DNA 
<213> Homo sapiens 

<400> 610 

gatgtttnnn annctggttc taatncttgg aaanctncnn ctttgttann ngcnntttct 60 
gcaggatccc atcgattcga attcggcacg agcccagctg gacctggtgg ccctttccta 120 
gtgcctctgc tgggggagga gaacctctgt ccacgtggag gctaggaggt ctcaggtgct 180 
gccctggcag caccagagtg tgggccgggc ccgagtgtct gcccctcggc cctcagggtg 24 0 

yggcacttag cacccagaag ggaccaaaag cagggcatgg cggtgcagag gagtttggga 300 
ggtgtaaaca gccccatgca cgtggaggag gagctggctt tcagccccag accccacgct 360 
agcactttcc acgctgcttg cccgctgttg atgtgcagtt cccagtgcct gtgtgagccg 420 
acatctgctc agtcctatcc ctcgtcagcg tgtggagacc cagctcctgc aagcccttct 480 
gcttccacgc ccccagacag cttggtggag ggtcctgcat ctgggccaag ctggggtgca 540 
cccagccaaa gacaaagctg ccttcacgtg cccaaaggat tcaagatggt gcactggccc 600 

cgggaggagt cttgaccaaa aatgggagcc cgctcttgtg gggaaanccc cgacttcccc 660 

caccnanaaa ccgntcccac ggtgccggan cttccccctt ttcctttgtg ggggcaacaa 720 

nattggcctt gggcnctttc aattnttncg gaagctttcc tgggtgtngg cttttgacct 780 

taaaat ?Q6 

<210> 611 
<211> 938 
<212> DNA 
<213> Homo sapiens 

<400> 611 

tgttttaaag ccctntttng aatncntggc ttncgncccc ttggcaagat cnctctctgc 60 

aggatcccat cgattcgttt gtatttttag tagagacagg gtttcttcat gttggtcagg 12 0 

ctggtctnaa actcctaacc tcgtgatccg cctgcctcga cctcccaaag tgctgggatt 180 

acaggcatga gccaccatgc ccagccaaag atcatttttt tatatagact tcagnccttt 24 0 

gtaaatattg taactgggga gtatagagta gaaaaaaagt atagntaaaa catttgttct 300 

acaaattaac ctttaaaaat ataattactg ctaaaaatag agtgctgtta cacttaagga 360 

aaattagtgc cattttggaa atgagatctt gtgccataaa tncagctgaa ctgaatataa 420 

atgttcacaa attaatgctg tnaaaggaat gagttaagca gaaaaacttt taaccagcac 480 

ttctcaaaaa anaaaannna nnaattaaat nntataancn ncatnnanat ntatnntann 540 

tttncntctn nattncanta attttgtntt ncaaatantt nnacctnnan ctntgttntn 600 

nttnnnncna tnnantatcn ntttatcnan tatatnatta nctnattntn nngnanngna 660 

tcntnctcta tncnnnatnn tncatatnnc gtccnntnnn nnaantatgc ctcatnatat 720 

ntacnnnaaa ngtntangta tgnttantgc atnnncatna ctnntgatgt cnnagtnnna 780 

natattttgc cnctcattat tntgctnatn tatntgtttg acacannata ctnnnancna 840 
ttcatcttct cgcaatnngn gnacttttna nttacnnnna tgntannnnt natatatnta 



900 



tcattagana ccttttnaat tntnnntncn nanacgcg 938 
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<210> 612 
<211> 771 
<212> DNA 
<213> Homo sapiens 



<400 
tgtttgnaan 
nctgtntgat 
atatttntcc 
ataatttctg 
ttgttctctc 
tgtatttctt 
atgtgacagt 
gatgctacct 
aaatatagta 
agacagctga 
ttaaatgtac 
ggtgacanag 
ctctgntcat 



> 612 
nncggntntt 
cccatcgatt 
tgctcgagac 
gaaaagtcat 
ttcctcgaac 
ttattattta 
gggaaatcat 
tctctctttt 
tgagtccagt 
gttataacaa 
aagagcaang 
ctaaaacaan 
ngaatatagc 



gaaatncntg 
cgaattcggc 
acatgatgtt 
agtcattatc 
tcctgttagt 
tttcttgtct 
tagtgactta 
atttctcact 
tactggccta 
atgagagtac 
cattctacct 
acaagccnaa 
ctgaaaaatg 



gtacnnaaac 
acgagataga 
tcatgtatct 
tctttaaccg 
catttgatcc 
ctgctacatt 
gaaattccag 
tcaaaataaa 
aggagctaaa 
tgttgtgtga 
gactgngtta 
ccnanaagga 
gntaatcaag 



nctttngnaa 
aactaggcac 
gtggcttttt 
ctccctctct 
tccatatctc 
ttacattgag 
ttggtcattg 
atttgcaaaa 
agcattctgg 
ctgcattaat 
ttgagctctg 
aaaccccagc 
aaagtnaacn 



ancncccctc 
tgatttgttt 
atagtttaaa 
tccattctct 
tgaatatttt 
taaaagtggg 
ggccaatttt 
acaaaaaatt 
gtttgtatga 
tattcccttt 
cancatacat 
tttagggata 



60 
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420 
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660 
720 
771 



<210> 613 
<211> 774 
<212> DNA 
<213> Homo sapiens 



<400> 
tttgaatcct 
gaattcggca 
cttaaagtaa 
ctaataattc 
ttccatctta 
aaagttattt 
aaaatccacc 
aactttcana 
tagtctcata 
nttcttaatn 
gcataacctg 
ncccnggnca 
cttggaggtc 



613 
tgctttcaaa 
cgaggtaacg 
taatacgtca 
agactaaagc 
tttctgatgt 
acgtgaaaac 
ttctgtcaca 
aaagaaaaaa 
tattcattcc 
atnattttta 
gactggttga 
ggnacaaaaa 
caacccccaa 



tncttggcac 
tgacacgtat 
tggccctgct 
tatttatatt 
atatgtttta 
aagatttttg 
ctatagaagt 
atggtcaatt 
acaccccccg 
aattctgacc 
agnagtaatt 
nttntaanga 
aaatcattag 



tngccctctc 
tttacttctt 
ataaggtagt 
gctgtgacac 
ttttctctgc 
tttgagttca 
atattttgaa 
tantttaact 
ttcttctttc 
aaaaataaag 
ctgntccttt 
acatntggga 
gncnaccaaa 



tgnaggaatc 
ttantaggcg 
agttctagaa 
cacgtggaaa 
cttcataaga 
tttacttgag 
ttatcaaaag 
ctatgtcaaa 
cttctttctc 
tngtggcaag 
aaaaaaantc 
attangcnaa 
attnaaaata 



ccatcgattc 
gacacacttt 
gactgtntat 
acttttataa 
actaaaaacc 
atatgtttaa 
gtagaattat 
aatttattta 
cctctgcctt 
tactttctta 
cccaactggg 
atggatnttc 
atcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
774 



<210> 614 
<211> 754 
<212> DNA 
<213> Homo sapiens 



<400> 614 

ttggantctt ctcngaaacn cttngcnatt gcnctntctg naggatccca tcgattcgaa 
ttcggcacga ggttcttcaa agccaaccaa gacaggcttn tnagttttag agcttcagaa 
caaattgcca aaagccagag ttgtttatgc tagtgcaact ggtgcttctg aaccacgcaa 
catggcctat atgaaccgtc ttggcatatg gggtgagggt actccattta gagaattcag 
tgattttatt caagcagtag aacggagagg agttggtgcc atggaaatag ttgctatgga 



60 
120 
180 
240 
300 
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tatgaagctt agaggaatgt acattgctcg acaactgagc tttactggag tgaccttcaa 360 

aattgaggaa gttcttcttt ctcagagcta cgttaaaatg tataacaaag ctgtcaagct 420 

gtgggtcatt gccagagagc ggtttcagca agctgcagat ctgattgatg ctgagcaacg 480 

aatgaagaag tccatgtggg gtcagttctg gtctgctnac cagaggttct tcaaatctta 540 

tgcatagcaa tccaaagtta aaagggtttg tgccactagc tcgagaggaa atcaangaat 600 

ggaaaaatgt gttgtaattg gtctgcantc tacaaggaga agctangaac atttagaaag 660 

ctttggaaag aaggccggng ggagaaattg aatgattttt ggtttcaact nccaaaaggt 720 

gtgttgcnct cccttctttg aaaaaacatt ttct 754 

<210> 615 
<211> 774 
<212> DNA 
<213> Homo sapiens 

<400> 615 

tgtttnaatg ctgttttgaa atcttgtttc aaatcctttg gctacttgct ctntctgnan 60 

gatcccatcg attcgaattc ggcacgaggg attctttcac tgacrcacaaa gaattottocr i2 n 

ggctttagca tctgactgat tttgttacgg ggttgattct gaccatagga agtatgcaat 180 

gtgaatcact atttacagag aaacctacaa cagatgcttg atgttgtaga aactgggaca 240 

tatagatacc aagcaaaatt ataagaaacc tataaggtgt tcaatacgct tgtgtttcca 300 

aaattcactg tncatgatca gtttggtgtt cttgtaccac agtttttaac tgaaggaacc 360 

agttgtaaca gtctcaattt ttaactaaaa cttgaagaac taanacaaca atgcaaacct 420 

ttcagoattg tttggccaaa cttgttaaaa ctgtaatgca agaaccaaat gcactgtgat 480 

gtggcaccaa ctaattagca agcatgaatt tttcacccaa nagtgaaaaa aggaaaatct 540 

accatggctt naagtttaag agcagaactt cctgactncc attctatgac tgatcaaaaa 600 

nactaatagt ttaaaacctn agcangcctt gttcacgata tgcngaaaaa aaaagtgctt 660 

gcagtttann atccttatgg aantttttca cantgtnaca nggtnttgta atacnttgga 720 

ngccctacat tttcntanga atntattttn cttggcctaa nttggnntca angc 774 

<210> 616 
<211> 769 
<212> DNA 
<213> Homo sapiens 

<400> 616 

atnncntttt tgnaatcctc tctgaaatcc tttgctactt gctctttntg caggatccca 60 

tcgattcggc cagtcctcac cttccctagt cctcgtgtgt attttaggag atgcgtgggt 120 

gtggaacagc ctcctgcctc cggtccaggt gtactggggt ctgtgtgttg tgtttctgcg 180 

tgttctcggc agaaagtggc atgctgtccc gcctgggtga tttgctcttt tacactattg 240 
ctgaaggaca ggaacgaatc cctatccaca agttcaccac tgcactaaag gccactggac 
tgcagacatc agatcctcgg ctccgagact gcatgagcga gatgcaccgc gtggtccaag 
agtccagtag tggtggcctc ttggaccgag atctcttccg aaagtgtgtg agcagcaaca 
ttgtgctcct gacccaagca ttccgaaaga agtttgtcat tcctgatttt gaggagttca 

cgggccatgt ggatcgcatc tttgaggatg tcaaanagct tactggaggc aaagtggcan 540 

cctacatccc cttnctggcc aagtcaaacc cagacctgtn gggtgtctnc ctgtgcactg 600 

gtggatngtc aanngcactc ttgtgggcca cacaanagat tccctttttg cctgcaanac 660 

cntgtntgaa acccccttaa cttatngccn atttncntna agcaaccctt aggcnanttg 72 0 

actnncnttc acaanttttt ggggcnaaag anncnaattg gcctgccct 769 



300 
360 
420 
480 



<210> 617 
<211> 766 
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<212> DNA 

<213> Homo sapiens 

<400> 617 

aganntcttc ctttctaatn nctngctacn ttctctntct gcaggnatcc catcgattcg 60 

cttcctcaaa gcatggttgc tgagnaccca nagttgcgag gngttttttt actgatttag 120 

ccaggtggca atcatgagtg aatggatgaa gaaaggcccc ttagaatggc aagattacat 180 

ttacaaagag gtccgagtga cagccagtga gaagaatgag tataaaggat gggttttaac 240 

tacagaccca gtctctgcca atattgtcct tgtgaacttc cttgaagatg gcagcatgtc 300 

tgtgaccgga attatgggac atgctgtgca nactgttgaa actatgaatg aaggggacca 360 

tagagtgagg gagaagctga tgcatttgtt cacgtctgga gactgcaaag catacagccc 420 

agaggatctg gaagagagaa agaacagcct aaagaaatgg cttgagaaga accacatccc 480 

catcactgaa cagggagacg ctccaaagac tctctgtgtn gctggggtnc tgactataga 540 

cccaccatat gggtccacaa naantgcagc atctctaatg aganttattc ttgcccttng 600 

ttcaangatc ttattgaaag gacatcttac agcttttccc aatgagaang cccangaagt 660 

gttaaacata ctgnnttgaa aaaagcactn tatnttntcc cntnttaana tggtntctaa 720 

aatgtanaaa naaannaaaa naaaanctcg atccctctnn aacnct 766 

<210> 618 
<211> 762 
<212> DNA 
<213> Homo sapiens 

<400> 618 

tttnnagnnt cttcctttct aatggcttgg ctactngttc tttntgcagg atcccatcga 60 

ttcgctcagt gcagcgatca tggctcagtg cagcctcaaa ctcttgggct caagcagtgc 120 

tccaacctca gcctcctgag tagctaggac tataggcaca cagcaccatg ccccggctat 180 

ttttttattt tgtagagatg gggtctcact atgttgccca ggctagtctt gaactcctgg 240 

cctcaagcaa tcctcccacc tcggcctccc aaagtgctgg gattaaaggc gtgagccacc 300 

gtacctggcc cttggtggaa tctttagggt tttctattca tacatataaa atcatatcat 360 

tggcaaacag agataatttt acttcctcct ttccaatttg gatgccttag atttcttttc 420 

cttgcctaac tgctctgtct agaactccca gcactatgct gaatagagtg gcaagagcag 480 

gcatttgcct tgttcctaac cttagagaaa aatccttcag ccttttacca ttgaggatga 540 

tgtttgctgt tagtttttca taaatgatct atatcaggct tgaataaatt tctatttcta 600 

aaaanaaaaa atataacnnn ntanttnatn aantnnttaa naaaanaaaa actggnacct 660 

ntaaaactta tagtngagtc gtttnaccgt anatcccana ntttgataan gatacattgg 720 

atnanttttg gacaanccnc aactaggaat ngcnntgnaa at 762 

<210> 619 
<211> 754 
<212> DNA 
<213> Homo sapiens 

<400> 619 

tttggagntc tttctttcta atncttggct actngntctt tntgcaggat cccatcgatt 60 

cgaattcggc acgagcggac ccatcggagc gtaacctgga tctccgcagg cctggcggag 120 

gccggccacc tggaggggca ttgcttggtt cgcgtggtag cagaggagct tgagaatgtt 180 

cgcatcttac cacatacagt tctttacatg gctgattcag aaactttcat tagtctggaa 240 

gagtgtcgtg gccataagag agcaaggaaa agaactagta tggaaacagc acttgccctt 300 

gagaagctat tccccaaaca atgccaagtc cttgggattg tgaccccagg aattgtagtg 360 

actccaatgg gatcaggtag caatcgacct catgaaatag aaattggaga atctggtttt 420 
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gctttattat tccctcaaat tgaaggaatn aaaatacaac cctttcattt tattaaggat 480 

ccaaagaatt taacattaga aagacatcaa cttcactgaa gtaggtcttt tagataaccc 540 

ctgaacttcg tgtggtccct tgtctttggn tataaatgct gtaaggtggn agccantaat 600 

tntctgcaan aagtangnca gcacttttca gtgatttgaa tatcatcttg gcttngangc 660 

cangtggaca accttgtcat aactgacttc tgaaaagaac cctntngata tttgatgcct 720 

cnggtgtngg tggaactgtc atttantngg anna 754 

<210> 620 
<211> 767 
<212> DNA 
<213> Homo sapiens 

<400> 620 

gcgttctttg aaagccctnt tttgaaaggc ttgcttctaa ttacgggaaa cctttgcaac 60 

tgcagatccc atcgattcga attcggcacg aggacccagg tagaccagct caagagttca 120 

tgttctttgt natcctcctg tgagctctct gtaagtcnnt ttcttgccca tcaccacatc 180 

cctagtactg ggtatcagtc tggccacttg gctttctggt ttgccccaat gtggtctatt 240 

uttgatgcag ctaccaaagc aatgttttaa aaccattata ccaagttact atccttgtca 300 

aaacccccag taactgccaa tctcacttag aataaaatcc ggactcctgt gaagcacagc 360 

ataaactggc cactgcctat gcagcaacct catctttacc gnttcctgcc ttgctcactc 420 

ccttccagcg ccgttattct tcctgatgcc cctagtacac aacaactcct tcctgctcca 480 

agagtaggaa aattactggt ctctctgcca gngagaancc tcttctggna ttacctttgc 540 

ttcattgcng aatcttctnc aatatcatct tctaaaaaga gccttttaaa aatcaccttt 600 

nctatnatgc cctactcatt tccagtccct gaaanggcca ttcccacttn antannactt 660 

attgctaacn tgaaatacac taaatgnnan ccttcatgaa nggtanggca anttaaatgc 720 

nttngcactg gnnaggcnaa gagaacaagc ancntggntt canaagn 767 

<210> 621 
<211> 828 
<212> DNA 
<213> Homo sapiens 

<400> 621 

tttctaatag cttgctttct aatnctnggn aacgctnggt ctctgnagga tccctcgatt 60 

cgaattcggc ncgaggggtg acagagtgaa actcgtatct ccaancaaac aaacaaaaag 120 

tncttaaaca tatgtgaaca aaaatttngt gatggaagga ttctagttaa tgagtattgc 180 

atcaagattt acatctttct tactaaggaa aagagttaat aaaaatngnt ctttatttta 240 

caggcagnta ctgaggctct tcccanntcn cagtanacag ccactcagcc ttgaaaatgg 300 

agtgttgttg tttctaaaca tatatttatg tcatttattn aggtacagtt cacttaaata 360 

accataagtn gantctctct tgtnagtgat ttgggtagga agaggccatg tctanagttc 420 

natttctctg ttgggtccna ntgaaattgg accttttnag ttgttcanaa aaatnaanat 4 80 

aaattnctca tattaaatca agannctcnt caanttatag atgtggggta gggttccnng 540 

taaaacccat tatnaatcta gaaaattatc nctatngana angcntttaa tatctnttac 600 

cntgaaattc attactttag tncaaggcct acctttaaan gtttnnncnn gaaccatttt 660 

tannnnttcn ncttttgnnc caananntca ttttaancca ccaaaantcn caattnttnt 720 

tncattnnaa tannggatgn naattatnnn atcnatgtgt catatttnac canganaata 780 

ctgngctncn tgnaataatn ggtacactaa anncnngann tttnntcn 828 

<210> 622 
<211> 784 
<212> DNA 
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<213> Homo sapiens 



gtctttgaaa cctttttcta atncttgctt tetaatnctt ggcnactcnn ctctcnctgc 
ag^cccatc gattcgtttg ctttcagtgg ttggctttca ctgaaagaaa g^taaanaa 
ag^agaatt tatagctttc actatgtcca agactaggac tgggttataa ^""tctt 
ttgtgaagga aaataaaaga aaatttgcca ctactgcatt tactttacta "gtaaactt 
aagattcatt ccttagtctt tggaattttg atgtctcaaa accagatgag tggaagtgct 
gSttgcaa aataaagcta agaatgctta actctgcact ttaagttcta ctctgaccaa 
S^gaaStg agcagagcag ccctgaacag catttngttt atacagtctt ? tttaagaat 
agaatttttt taactcttca tttnttgtct ctgtggaagc tgtgtaactc "tttaaaat 
gcaatttaaa acattntggt attctaacaa ttctctcaan aaacagcatt tccaatggna 
atnggtattg ntacgctgta ccttatgtat tncctgtacc tgaacacttg atgctgcctn 
acangaaaa? agaactttat gttaaaaaat aaaagtctgg tncttctttg naaaaacaac 
Scn^tnctn cLnaaatcc ncnacannnc tnnnaatntn ctaanntnag tctnnnttnn 
ngcanncttn trmncccnct nanctccctn tntcntnttc atatctanan tnacanccct 



ccct 



<210> 624 
<211> 798 
<212> DNA 
<213> Homo sapiens 

<400> 624 



60 
X20 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
784 



<210> 623 

<211> 1164 

<212> DNA 

<213> Homo sapiens 

gggacttntrangccntttt cgaaatccnt tncttccnaa tcccttngca actntcnnct 60 

nStgcaSa tcccatcgat tcgaattcgg cacgnagnga gcnnattcnc »0 

ttctntttct ntnatnnaca ngngaaantt ccaggnnatc ntgnnnccnt atctgantna 180 

ngctngnttn aacntngnna caccnngnct nnnaancaaa tttnanaaaa 240 

nanancatnn narmtimcca atctaccaaa atcanaacac ncantgaaca acacananna 300 

tnnnatacnn tctacnccaa ancnncncat nncacgcacg ataanacanc ™ 

ancnaancan atatcanann caaccntana cnannaatca nacnctnanc tcocncacag 420 

canngngacn aanaacnanc antgataaan cncacctnnn tannacacac otnannancc 480 

ZSELtcc cgaataacca atngccaoxm ctannccnat aacanantcn ctxuaccctc 540 

rtmatcaaa ttantaaatt cncnancata aagnanatca cagcctcntt cnaccnntga 600 

fcXan'n* an^cnangn nanncnntat naaacnctat ancantnnna 660 

nnatcngcnc ntanaaatta aanatcnaan actcaatatn ^cggaatant ^tctcta 720 

nataannnta naacggngna aanacncctc anacataann gncntacnna tcgatctatc 780 

Lntnancat aaagtcaccc gcatattnac cnacgnncaa cataannnaa *tnctactct 840 

caaaccatat aaatntcgcn tccntanatc agngcnanan tacaaanacg tcgcnnnngt 900 

ntggacSca cgncntagat aaacacnnat aaacantttt tanatgtaac acatttcnna 960 

^Stnaaat aSatcattn atgnanacga tnacaacaaa nnctacncna tgntactaaa 1020 

nacaantaaa nntnanatta aaaaagttgc aannatncng ngaaanntcc -anaaacan 1080 

tanatncnta tttannnntn acnncggngt nnccntaaaa anaactctnn nntnnctggn 1140 
ttgtanatnt annncnanct cgcg 
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ttgttaagcc tnttttcnaa ntccttcctt tnaaatcttt tgnaaacctt ggtanttgca 60 
S^f^' ° 9attCgagt ^gcatcct gcctcagaat gactttccta S«tgct?f 120 
tgtgtcattc caaggtttct tcatgagtca ttccaagttt tctagtccat accacagtgc 180 
cttgcaaaaa acaccacatg aataaagcaa taaaatttga ttgttaagat acagtagtgg 240 
c C a C a C ; C ' ta ttcagtcaat taagagtaag tttttttatg tggttattaa aacagtatga 300 
acaattagtc taactctgca tagacagggt ctagattttg ttaacccaaa tgtataactg 
^!"!? C aaattacaat ttgaagtctt gtggnttnta tatagctngg cactttatta 
ctcttttgaa ctgaaagcac actcccttat aggttcatgt aactgtcctg taataaggtg 

tttta^ ff a T a T Cacagccta ^ "ttgncaca acctttagca tctaaaaaag 
ttttaaaagc ttcttaaatg nctaatataa anggagatgc tnatanccac aacatctatt 
It,, tagtttcctt acacttacct tgggantttg cattgagtga ngttttngta 

ttncttaSa Sn^*" 3 ""ggtacgt tttnatgact ttaatccann 720 

ggg^agg?? SSST aaaan9Cttn ««W»t ggaantnnna ntnatt t „ t g 780 



<210> 625 
<211> 793 
<212> DNA 
<213> Homo sapiens 



<400> 626 



It^ 9 T Ctttgnaaat =cttntttet aatntntgga tccttggcna ctcgctntnt 
ctgnangatc ccatcgattc gaaacggcnc taggaatcat cgaaggttga gaccgtgacn 
anttacatag tgatnaatac ccatctatgt actgnngcct nctaaatgtn tntctncnnn 
SCS? cctttaanct ctagatccat tgacancctg ancatntcta aaaggca^ 
ngaaactgaa cacatctgat acagaactct gcattnnctt ccnaantntg cocaLccna 
gcctgntcct nnttcacgct tancacttat natatgatcc cactattcac tnantctctg 
aagcttaaaa cctangattc atgcttgact actgnataat nntacaatct actcctaatg 
cattagcaat tcttgctagc tctaccttca aaatatattc tgaatagact atntcttgcc 
gnttcccttg cctncncatt tcccatctgc accccttctc tnctncccaa aatcaataca 



360 
420 
480 
540 
600 
660 
720 



798 



<400> 625 

ttcttaagcc ncttttctaa tgcttgcttt naaatctttt gnaancgctc ggctntntgc 
StacaT" CCga " Cgaa ggaaatgcct ctatgtangt glagtgttct 

HIT ?° aaCa3taaaa ^"aatataa tattttcccc acaaaagaaa cac? t aacag 
aggcaagtgc aatttataaa tttatatcta aaggggaatc atgattataa gtccttcagc 
ccttggactc taaattgagg ggattaaaaa gaatttaaaa taattttgaa cgaatttaJt 
:= C a C Ca9 tttttgaggg cattaaaaag gcattaaatc aagacaaatc atgtgcttga 
gaaaaataaa attaatgaaa acacagcact tatgttggtt tagctgcagc ctccttggag 
S tta tttatttaaa attactggtt gcatcaagaa ccccataggg tgtacaXag 
gttctataaa atctgcatta tagagacaaa gangcaggca aatncatgtc acaagggtna 
agcttacagt ttacaaactg gggaacgccc agggtgtang atttnaaaaa cgncactctt 
!^ aaaa ° an at 9 taatcan 3gntgctgaa aactttgcat ggnggctttn aagaoattta 
gnccttgttc aaaccaaaat ttnttggnat ttgccagatt ccttantntt gccatgggcc 

g™:: t£ t99CCtt tat9nCnCtt taaaattttn aattaaaaat ™La 780 

793 

<210> 626 
<211> 825 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
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ctagntgttt ctaaaaaaaa tatnganann tagnnnaaaa ncntaaataa atntaaaana 
angStancn tnacanaana ttnctaatat aggnnanntn ntgncaanaa cnntaantnt 
tnaatacgnn aaaactctct cnaanngann aanntatnnn agttaaaagn naaaUnnnn 
aanantncca aatntanaag ataangncat aannntatna gncnnaacgc taantgnnga 
tganntntaa tnngnatana nnantngtta nnacaaaatn tacnn 



<210> 627 
<211> 772 
<2X2> DNA 
<213> Homo sapiens 



tctacanqac ccw^yati. w 3iJ v*«« ^ ~ — 3 ^ — 

agagtatgac ctacctcaca gnagggatgc tggaccaacc tggctttccc gactgctcca 
tcgaggcagc catggtgaag gtgttcanct ccgaggccgn ctgncagtgt gtgagtgagg 
cnctgcagat cctcgggggc tngggctaca caagggacta tccgtacgag ^tactgc 
gtgacacccg catcctactc atcttcnagg gaaccaatga gattctocgg atgtacatcg 
ncctgacggg tctgcagcat gccggccgca tcctgactac caggatccat gagcttaaac 
aggcSaagt gagcacagtc atggataccg ttggccggag gcttcgggac tncotgggcc 
naactgtgga cctggggctg acaggcaacc atngagttgt gcaccccagt ^tgcngaca 
gtgccaacaa atttgaggag aacacctact gotttanctc ngaccgtgag ^acttgctg 
ntnccntttg gcaaagacca tcatgganga ncanntnggt nctnaancng nntggccaac 
atnctcatca acctgtattg gcatgnaccg cncttgctgn acnncngngc caaancnctc 
nantccgcca ttggggcttc cggnaaccac tnnacaccaa ggttctnttg gc 



600 
660 
720 
780 
825 



tttttaatgc'ttngtcgnac ttctcccagn aatcgnttng aaactcngcn ^tcgttctc 60 
tctgcangat cccatcgatt cggaaatttg cactgatggc tcanaaggct tacgttttgg 120 

a 3 , 4 -,~~r>^^r+ <-rrfTrihf.t-.ccc aactactcca 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
772 



<210> 628 
<211> 808 
<212> DNA 
<213> Homo sapiens 

tcnctcgiarcntttnannc ttggctactc gntctctctg caggatccca t^tcgaa 60 

ttoggcacga gatgacatcc tcattatcca cantgcaaag ccaaccatcc ctatgatggg 120 

ttcattgtgg atcatgactt antgggtcaa gagtttggaa gtggctcagc tgggcggnct 180 

tctgctncS gtggctgcca natggtnccc tgctggtnng cagnctngtc tagagggtcc 240 

atgatggctt tactcacatg octggcatct tgacagggac agctggnang « aa 9™^ 300 

ctgggStgt ncacagagct ncttcntgtg gcctttccag catggtggtc taagggtagc 360 

tggacttnct gcatnacagc tcagggctcc cagagctact gtcccaagag atnnaaagtg 420 

gnaactgnca atcttttang ctaangncca gaaaccatta cccctgcacc -acagtctt 480 

Ltnta^ctg ntgaaataaa cattnnnttt atcaattnta ancattcgca ^attggaatt 540 

aaataccttt tactaatttt gncgtgacoa tctgcccctn gttcaagatc taaaaaactt 600 

ttatngntca tcntgnngat ntaaaaaact nttgtgttng catttanaac ^ntaagcan 660 

nttnggcant tanannnaan annttrmnaa acccttntat anaacottat taagttgang 720 

catxmgnant ttcnccttna aatccnaggt ccttagggct angnnatacc nttcntatng 780 
naactttngg gaacctaaan cctctcct 

<210> 629 
<211> 827 
<212> DNA 

<213> Homo sapiens 
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<400> 629 

ggccnncttt gaaccttntt caaatcnttt ggcactcgcc nctctctgnt ngntcccatc 
gattcgctgt gatccaaggc atgaaaagag tgcaaggtaa ncangnggca gcnttnatng 
aagcatnaaa taangcnaaa gcnnatgctn anctnangga gcangnngct aaagacaacc 
acannctanc tgnntnctaa tcatgctntg cttnctnang tgancttata gnaacgcant 
nagactncan gcnttgcttg gcncaacaag grmacctana ntcatnanga agcnnttgaa 
ctaangagtg gctacnncct ttnntnctca tgcntgacct gtaatnattc ttctganttg 
aggcaanagc gggtnnaant natngntnan ntgnaaanac tntnnnatcc gnnnntnctg 
attannttnc attnntntna atgatanann ctcatcnngc tcgncctgna ctttganang 
ctnnimtcnn anntnnctga ctttaggagn nnacctncag cganatgtna agnanngaaa 
ttnanntncc tnncgntccn ccttgcngac tnanngtctt gngnaacntn angtanntan 
tctacngggn gnnacnttgg nnaattgggg ncttataaan tnttcntnna agaatgantg 
naccaattnt nnaanntcta agnttgggga aatctnngtt tcctgnatnn gnacaaaaan 
tcgatttann ngncnngntt nnttgggcnt catntgccat tgatgcnatt cnacttatgt 
cctcntggng cntnttnnaa nnnggnngnn aacatttttt gtgtgcc 

<210> 630 
<211> 793 
<212> DNA 
<213> Homo sapiens 

<400> 630 

ttcnaatgct tggncnngag tccncctttg aacnttttca aatnncttgg caactcgcnc 
tctctgcatg atcccatcga ttcgaattcg gcacgaggcg ngttgttcta cactgcnntc 
ngaagntttn ntaanaagcc accacttagc ngaggcnnct acangtcttg gggncttagc 
gaagagaaat cncgctggca cttgncccgt tcacntaagn actnntgnct gantccnagg 
gtannngtnc accttgngnn ccancagaca nacccaannt gncntaaaan gggcaggtct 
aagcttacnc tngactncac nggcaagctg nangcctgtn ctgccttccn ctgcnntnac 
aaatngacag tnngaccaag agtcanagna aaaactncaa ggatacatnt atcccantct 
nttctacacc tntanattcc ntganctatt gctcanaccn atcgtgcggg caaaggcaag 
acttgggcaa cattnttnaa tacaatgatg ctgacaanta atttccngct ngttgccagg 
natntttacn cgagctnttg tgattccaaa ctaaagaatg gngccnnnan gcccntcntt 
antnctggnc ccccanaang ancctaactn gcgaaaggnn agnatggcat tnacccaaac 
caacttntng gattacnnca actccanaan atccgacggc atnnaanang caaaacaaca 
acttcncnan natnnaanna atngnnccnn aaananaacc cgngcntctn aaacnattgt 
ggacccatnc ccc 3 



<210> 631 
<211> 752 
<212> DNA 
<213> Homo sapiens 



<400> 631 

gnagtnncct tngancctct ntnaaatcgc tttngcnant cgcctctttc tgntngatcc 
catcgattcg aattcggcac gagatgttac agacatgaaa tatgaacaga atnctaaaag 
aacataaaag aataagagct ccttaaagat tataaataaa tggtgatgtt aaagtaatag 
caccattgga cgaagctagg gaatcaacac ttgacagaaa gatacatatt ttttttatac 
aaactacata tatttgagca atcaagtagt agacatagag aattttcttt ttatggaagt 
actctaataa gtaaagggct gatagaatta tatcagcatt ttctagctcc tggtgaatta 
tgcattgggc atccatggct gccttagatc acaaaaatac caccagatat atgcctgtgg 
atgaaagatc acaccaccac ctgtgaaata gtcttcccca caaaaaatcc aacccaaatc 
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ctatccagcc tgtagatggt actcgagatc ttctataaga aataaagaga ^angc^ggt 
cacqgtggat tgtgcctgta atcccagcac tntgggaggc caangcaggt ggatcgcctg 
angtaaagaa gttcnagacc agcctgccaa catggtgaaa cccccctctn tacttaaaag 
taccnaggat gagcccggcc gttgtggcaa gcacctgtgg tccccagcta cttgggaagc 
tgagcangaa aaatcgcttg aanctgggga ng 

<210> 632 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 632 

gnnnnnnttn nnnnnttcta atgcttggct actcgttctt tntgcaggat cccatcgatt 
cgcaactaga gaagattgga cagcaggtcg acagagaacc tggagatgta gctactccac 
cacggaagag aaagaagata gtggttgaag ccccagcaaa ggaaatggag aaggtagagg 
agatgccaca taaaccacag aaagatgaag atctgacaca ggattatgaa gaatggaaaa 
gaaaaatttt ggaaaatgct gccagtgctc aaaaggctac agcagagtga tttcagcttc 
caaactggta tacattccaa actgatagta cattgccatc tccaggaaga cttgacggct 
ttgggatttt gtttaaactt ttataataag gatcctaaga ctgttgcctt taaatagcaa 
agcagcctac ctggaggcta agtctgggca gtgggctggc ccctggtgtg agcattagac 
cagccacagt gcctgattgg tatagcctta tgtgctttcc tacaaaatgg aattggaggc 
cgggcgcant ggctcacgcc tgtaatccca gcactttggg aggccaaggt gggtggatca 
cctgaggtca aggagctcga gaccagcctg gccaacatgg tgaaacccca ttcttttctt 
aaaaatacca aaaaatttag cccangtgtt gaatggntgc atgcctgtaa ttcccagctt 
ctnanntagg ctnanacaag gagcttncnt t 

<210> 633 

<211> 806 

<212> DNA 

<213> Homo sapiens 

<400> 633 

ttnnannncn ttttnaaaag gcctnnnntt gannctttcn aatgcttggc tactngntct 
ttctgcanga tcccatcgat tcgaattcgg ctntagggaa ggggagggtt ggtgagtccc 
agaccttaaa aatacaaggt taagagggac cccaaagcaa aaaattccaa cccttttcct 
cccagtcatt gaaacaccaa aactattata ccggagggtg taatagtttt gctgcccagt 
tgtggtaggc cagtagtggc ctcccaagat gcccatgtcc taatcccagg aacctgtcaa 
aattaccttg tatggccaaa ggggctttgc agatgtaatg aagttaagga tctttcgcca 
ggaagattat cccagcttgt cangagggct tgatgtcctc acccgggtct gtataacaga 
agagcaggtg acgggagagg aggttggagg tgtancgatg gacangaaac tggagttata 
ggagggcagc tnaagccaca gaatccaggc cancttanga gcccaggaaa atgcatttct 
ttccacaaaa gcccttggaa ggccccaanc cctgcttccc acccttggac tnggcttcaa 
tgaggcttaa tttttataaa ttcntggctt gattttagaa ctcntaaggg gaaataaatt 
ttgtgttngn tttaantcan aaaataaatn aattaaaaaa aacttgaanc ctttanaaac 
tntantggaa ttcntattan cttaaanccn aancttggat taaaggatnc atttgtttna 
anttttggga cnaaccccca anttnt 

<210> 634 
<211> 775 
<212> DNA 
<213> Homo sapiens 
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<400> 634 

ngggacttcg cctnacgaac cgctnggaaa tcccntntnt gnaggatccc atcgattcga 
attcggcacg agtataaact ttattttatt ctcttctggt tttgtgttac atgacaagaa 
attgaattaa nncaatanaa ttttagttcg ggttgcttag gtttttactg ctcccattct 
tgcttttact aatttatcca agattagatg tgattactat ttaataataa tttagtcctc 
acacttacaa accacttaca ataccagcat gcttctatca ctgtaattct attcaattct 
caggcccatg aggcatgcca gccagacgac cagacagcat ttatagagtg ggcactcaat 
accagccaca aaagatcctg tgtcagaagg ggaaacaggc ttggaggctt ggagtatgtc 
gtgatagcct ccctccagtc cacacaactg gtactgctgg ggctgaaact agaactcang 
cctatgcctc tcaagctcaa gggtcggatg tccatgtnct tcgcctctag aactatannn 
gagtcgnaat tacgtagatc caagacatgg gtaagataca tnggatgagt tnggaccaac 
ccaccaacct aagaatgcan tggaaaaaaa tgcttaattt ggtgaaaaat ttgtgatggc 
tattnngctt aaatttngnn aaccatttna taaagnctng cnantaaaan aaaggtttaa 
ccaacccaac caattggcaa ttccatttca anggtttcaa gggtccaang ggggg 

<210> 635 
<211> 784 
<2±2> DNA 
<213> Homo sapiens 

<400> 635 

ttgagngtcc tnctttnacc ctttcnaatn gcttggcnac tcgctctntn tgnaggcatc 
ccatcgattc gaattcggca cgagatatag ctctggaggt caggacatag gagatattga 
ttcaggactt gccagagtat ggtcttgggg tgtgccctga tattacaaac agggatctta 
gtggctaggt gatgaggcca tggcaaatgt agatggacca agatcaattt gcctttctag 
atgaggtttt ctaggtgaaa tgtttttgaa actattttgt agcctagtat aatttataaa 
agtagagaga aactataaat ataaatttgg aangggttag ctaaaaggag aaaacagcan 
aatcttcata tatatanaaa tggatattaa tttgctagaa ttaanagact gcaggtaaag 
atagnttttt ttaatacctc tttttgctgt anaaaggaca ggattaaatg atnaagggat 
gctggaatga ggaatggtaa ctttaggcaa gatagtcttc tgngacggct gatatgaaca 
atngagagta anacatttnn aatacaanaa attgtcctgc tgctcaccca tcaagccttt 
tcangtttct tcccttgcca aaantngtaa naacttntgg tacttttnna ncttgtatnn 
ttccngttna ttggttanaa ccccttcgat naanaanncc atantttnaa tttgggnttg 
acccccnagg ttaaaanttn ccntttnctc aatttcccct tttcaaagnt ttaacntaat 
taan 



<210> 636 
<211> 765 
<212> DNA 
<213> Homo sapiens 



<400> 636 

ttnnannctt tcnaatnctt ggcnactcgt tctttctgca ggatcccatc gattcgtcct 
gcgcaggagc cgcagggccg taggcagcca tggcgcccag ccggaatggc atggtcttga 
agccccactt ccacaaggac tggcagcggc gcgtggccac gtggttcaac cagccggccc 
ggaagatccg cagacgtaag gcccggcaag ccaaggcgcg ccgcatcgct ccgcgccccg 
cgtcgggtcc catccggccc atcgtgcgct gccccacggt tcggtaccac acgaaggtgc 
gcgccggccg cggcttcagc ctggaggagc tcagggtggc cggcattcac aagaaggtgg 
cccggaccat cggcatttct gtggatccga ggaggcngga acaagtccac ggagtccctg 
caggccaacg tgcagnggct tgaaggagta ccgctccaaa ctcatcctct tcccaggaaq 
ccctcngccc ccaagaaggg aagacaagtt cttgctgaan gaacttgaaa cttggcccac 
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ccaactgaac cgggacccgg tcatgcccgt tccnggaaan gtctattata aaggagaaag 600 

cttcgagtca tcanttgang gaanaagaag aatttcaaaa gccttcgctt atnttcngta 660 

ttngcccgtg ccaaacnccc cngctttttn ggcttaccgg ccaaaaagaa gccaanggan 

gcccnnanaa cagggatntt gaaaaagaaa naatnaaacc ctcnn 



<400> 638 

anttgntctt tntgcaggat cccatcgatt cgcagcaaag actttatttt tgtacagaag 
atggtgaagt ccaagacggt ggctcagtgc gtggagtact actacacgtg gaaaaagatc 
atgcggctgg ggcggaaaca ccggacacgc ctggcagaaa tcatcgacga ttgtgtgaca 
agtgaagaag aagaagagtt agaggaggag gaggaggagg acccggaaga agataggaaa 
tccacaaaag aagaagggag tgaggtgccg aagtccccgg agccaccacc cgtccccgtc 
ctggctccca cggaggggcc gcccctgcag gccctgggcc agccctcagg ctccttcatc 
tgtgaaatgc ccaactgtgg ggctgtgttc agctcccgac aggcactgaa tggccatgcc 
cgcatccacg ggggcaccaa ccaggtgacc aaggcccgag gtgccatccc ctctgggaag 
cagaagcctg gtggcaccca gagtgggtac tgttcggtaa agagctcacc ctctcacagc 
accaccagcg gcgagacaga ccccaccacc atcttccctg caaggagtgt ggcaaagtct 
tcttcaagat caaaagccga aatgcacaca tgaaaactta cangcagcan gaggaacaac 
agangcaaaa aggcttaaaa aggcggtttt tcagctgaaa tggcaccnnc aattganagg 
actacngggc cccgtggggg 



720 

765 



<210> 637 

<211> 853 

<212> DNA 

<213> Homo sapiens 

<400> 637 

ttttggancc nttctttgan nctttctaat gctgggntac tcgntctctc tgcaggntcc 
catcgattcg aattcggcnc gaggatcagc ccacctcggc ctcncaaagt gctgggatta 
caggcgtgag ccaccttgcc cagcccacat catacagttt gaaatgaaac tttgccacaa 
ccagcctttg ctgtagcaca cacatatatc actgaacctg tttgaaataa agtttttttt 
ctttntcctc tggtattctg ggttctgaag tctggtattc tggtattctg ggttcaaaag 
tatgacttga gagtgttgct ctggtattct gagagttgct ctgtattctg ggttctgaag 
attatttgaa aaataactcc tactacattg aaatgcagac ttaaaaattt aaacattgga 420 
ttangcagtc aaaaaaacca agcaagcata aaaggtcaat aagttgtaat cttgatagta 480 
aaggtggaaa acttattata aatggnaang aaagttttat ttcctttttt gtttgaatgg 
gcaagtatgc catattatac ccaaaagttc ttttaaaaaa atatttccca ttcaacccat 
ttttaattna aaattaaaac cattttgnaa gggaaanttt acccaanggc aanccttttt 
tttcctccaa aaaggttnac cntgttnatc cttctttttn ggnaaattta nccaccaatt 
tttttaaagg ngggncaatg gggnttaaaa ntanccctgn aagnnatttt ttnanccttc 
caggtttaaa antccccttg gatngggtct taacctgggn gggtngnata naaaaaaata 
natcctnttt anc 

<210> 638 
<211> 740 
<212> DNA 
<213> Homo sapiens 
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<210> 639 
<211> 774 
<212> DNA 
<213> Homo sapiens 
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<400> 639 

ttttnnctnt taatcaatcc tttgttgact ccttggctac ttgttctttt tgcaggatcc 60 

catcgatncn aattcggcac gangtgatgn cagattgnna ntncactaaa ctgggcannn 120 

catcaggatc acctgtgggc cttcannaat cananatnca cccccaggcc atgccctnga 180 

cccagtgcac caggacaaga aatccacccc aggcctctcc cnagacccac tgnaccagna 240 

caagaaatcc acccccangc cangccccnt acncactgcc ctangatntn nnggtgtnaa 300 

ccnggtggtg ctttgtaaag acgtgcangt ggtaacccca cgccgncncn ctcnnnacnt 360 

tggacacatg atcatccacg tgtctgtgat ttgnttcctc ggnttnnttt gtgaatngaa 42 0 

aataantgtn ncgtttgact agggtttaag agcagcaggc agnccctcag ctcagcaagc 480 

ngccctctca gctcagcang cagcccaagt ctcctgtang acttctatgg accatnctgg 540 

cgggaatgaa gaaactggtc aagctggatt cgggactgaa agtgtaccnt ggtgacaccg 600 

tatgactnan ctgactnana aagatcactn atctttccac acttgnnggg naggagccnn 660 

tannangttc aatatgcnnt ggtngantcc catngctaca atttcatgga cacantttga 720 

ttacttnnga taannnaggc ccttggaggc cccttntccc cttttaacng gaat 774 

<210> 640 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 640 

ctnnncctcc ttgatccntt cctnctttga anncatnngc tacttgttct ttttgcagga 60 

tcccatcgat tcgaattcgg cacgaggctg acctacatca gaagctgctg gatgcagnaa 120 

agtgaaaaca gaccaaaaca acacngggcg aatcttnaca ccattntggg tgccnnatnt 180 

nnccnnngat atttgcttgc tnagctctac tcctccaaga nannangrxnt caaacnctnc 240 

agcangntag agcanntnaa gaccgcntnt nctnacctnc tnaagannct ctgngaggan 300 

cgcaatcctt tngtggaana tagaatcaac agaccacact gcnctctgga ccatgngctc 360 

tcaaangngc tagaaggtgc tgaccttttn agactcttgc agaagaggcg angtggtgng 420 

anaccctnna ggaanacttt cccgaactag accncnnctt ncngaacnng ntcaactgtt 480 

ggggnngaaa ncntgtgann tgtngncctt cngagagacg gcatattcta tgatggcnga 540 

cttnatnctt ctgcggaacc anactngacn tactgaaaga aanctganac caagcgtctt 600 

ccttaaggac ccttatatcc agacnatcct ttggataata ccnctnggcc aaaacctnnt 660 

aactntgcat acaatcngga tggcaacatt tgaactggng gccttnanna ccnttaccgg 720 

cttttcncat tatgnaagag ntn 743 

<210> 641 
<211> 740 
<212> DNA 
<213> Homo sapiens 



60 



<400> 641 

ctttcctttg antcttcttc tannaaacgt tngaacgaan tcngcacgag accactaaca 

gcatctactt gactactgat actttgatca tggagtttgg gcatgccact tgatagaaat 120 

ttgaagagca attatatttt tcaaaaagag ttttgaataa tgttaagata gattgcaaca 180 

tgactatcaa ttcttccctt cccatcaaag gagagagtcc gtttatccag cctttgaatc 240 

ttgattattc aagtgacttg cttcacccaa tgtaacatta ataagcacaa tacaagcaga 300 

ggcttgccaa gaacttggtt tgtttctaat gcttagaaga agaatggtgt atgccatatt 360 

tctgcattta gaactcacgt ggagacatgt gtggcccaat tgctcctctt tcatctcagg 420 

caataaccag acacgggact gaggccatcc atgaccagcc agccctagtc aacacacaac 480 

acacaagctg atcacagatg catgagtaag cctaactgag accagccaag accagcctag 540 
aatagaactg ctcagcagca ataaaaacta aataaattgt taccttaagc tacttttaga 
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gctatttgga agtgtatttt tgtgcagcta acatttacta tcagataaaa tggtgattgn 660 
ttatctctgn tttaatgatg ntttaaggaa atggttctat taaaaggaaa tatctggggc 720 
tttgtcaccg ttaaaaaaat 



740 



<210> 642 

<211> 737 

<212> DNA 

<213> Homo sapiens 

<400> 642 

tancctttga nnctttctcn ncntgncntn nnngnaacga cctcggcacg aggacacccc 
agatgcagcc accaccagca gaagcgatca nctgacccca caaggttttc gtggctgtgg 
ccgtgggctc aggtggcagc tatggagccg aggatgaggt ggaggaggag agtgacaagg 
ccgcgctcct gcaggagcag cagcagcagc agcagccggg attctggacc ttcagctact 240 
atcagagctt ctttgacgtg gacacctcac aggtcctgga ccggatcaaa ggctcactgc ™ n 
tgccccggcc tggccacaac tttgtgcggc accatctgcg gaatcggccg gatctgtatg 
gccccttctg gatctgtgcc acgttggcct ttgtcctggc cgtcactggc aacctgacgc 
tggtgctggc ccagaggagg gacccctcca tccactacag cccccagttc cacaaggtga 480 
ccgtggcagg catcagcatc tactgctatg cgtggctggt gcccctggcc ctgtggggct 540 
tctgcggtgg cgcaagggtg ttcaggagcg catggggccc tacaccttcc tggagactgt c ™ 
gtgcatctac ngntacttcc tctttgcttc atccccatgg tggtcctgtg gctcatccct 
gtgccttggc ttgaatggct ttttggggcc tggrxcctggg ctgttaaacc gccgggctgg 
natttaacct ntnggcn 

<210> 643 
<211> 748 
<212> DNA 
<213> Homo sapiens 
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<400> 643 

cttttaaccn tttganccnt ccctcnaaac cttngatncg anttcggcac gaggaaggca 
gaagtgtaaa tgaacataca ntttaaggag aaagcctgct gtgtttnnct tgttcagcag 
ggtattatga attagcacaa gtattgcttg ctatgcatgc taatgttgaa gatcgaggga 
ataaaggaga cataactccc ctgatggcag cttccagtgg aggttactta gatattgtga 
aattattact tcttcatgat gctgatgtca actcccagtc tgcaacagga aacactgcgc 300 
taacttatgc atgtgctgga ggatttgtat gacattgtta aagtgctcct taatgaaggt 
gcaaatatag aagatcataa tgaaaatgga catactccct taatggaagc agccagngca 
ggtcatgtgg aagttgcaag agttctttta gatcatggng caagcatcan cactcattct 
aatgaattca aagaaangtg ctctaacact ngcttgctac aaangccatt tggatatggg 
gcgctttcta cntgaagctg gtgcagatca agagcncaaa acagatgana tgcacactgc 
cttaatggan gcctgcatgg atnggacatg tanaggtggc acgtttgctt tttggatant 
nggtgctcan gtgaacatgc ctgcataatc atnttgaatc tccattgacg ctagctgcct 
gtgganggac atgttgaaat tgcngcct 
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<210> 644 

<211> 759 

<212> DNA 

<213> Homo sapiens 

<400> 644 

tcnnncnctt ttcgatcttt tgagncttgc ctttgaaccc cttggntacg anttcggcac 60 
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gagggaacca tgananccna gagctagaat tgctattgga tnncgtctat tctctntttg 120 

cttattgggn cgngntncgt ggttnctggc ctcangggtn nncccgaang anggggtatc 180 

tnngagcnan ttntgcnntt tacnggctag cttgntgggg gcttaanntg ccactnttan 240 

acatgctnta ctantcantg agannntncn ntcgaccatn tannacnatn ctgtgnnntc 300 

cngtacnctn tggccgnatg gagctattag cttcaanatg nntcgnantg ttacatgcan 360 

ncactgannt nactatccan natntaagtn ctcttngctt actgtgaaca nnngctactn 420 

ncttggatat tatagnaagg ntcnttgata cncgatnatc ntncntgtca gatcnataaa 480 

tancanctat accnactgtn naaatnccat ctggnggnct tncnatccan acataattgc 540 

attannncgt cnaattgnga tanagtnttg aaagantctn ggtttagacn ttggatgttg 600 

caatgnttgt gncttanaan ttatgtgctg gctactgant aanctggggg catgacntta 660 

ctggnttgac ctaagnggng aantcnatgg tccgattgct ggnccctanc cttaagnttt 720 

gccatgaata ggncttttgc cctaaaataa naccccttt 759 

<210> 645 
<211> 766 
<212> DNA 
<213> Homo sapiens 

<400> 645 

tnnnnnnntt tcaatntttn ancgtccctt aggatccntc gattcgatcc agatgggata 60 

cctctaaaca cgaaaagaaa gaagattcca ttantgaatt tttaagtttg gtttnatcaa 120 

aagccgagcc acctangcaa cagtccaccc ccttagtaaa caaagaggaa nagcatgcac 180 

cagaatcatc cgcaaatnag acagtcaaca aagatgtgga cgcacaggct gaangagaag 240 

gganccgcca tccatggact tattcatggc catctttgcc agttcctcat atgaaaagtc 300 

ctnatcctgc gangatganc acggtgacag tnaanatgat caggcacgct ctggngagga 360 

caacttccaa agctggnaag acactgactt ggnggaaaca tcatctgtgg ctcacgctnt 420 

tgtgccagng ccctaggagc cgtcaccttc cttcccgata caaangatgc agatagatna 480 

naganaagag ntcggccngn ngctgcctcc cgtcttatgt nccaatgctc gtcagacact 54 0 

tgaagttnct canaaagaga aacattccaa gaacaaagac nagcacaang gcaatanaga 600 

acacaggccn gaaagaattg anangaaatt ggaaacactn gaagcacnaa acacctaang 660 

naatccaaaa naattggcaa accaggggaa aagtaggtnc ctncgngaag tttcgacagc 720 
cngcggacaa gccanaattg acnatgaaac cgcatacgtg tcttnc 



<210> 646 
<21l> 752 
<212> DNA 
<213> Homo sapiens 



766 



<400> 646 

ttnnnnnntt tttatcctnt natncttnct ctttggatcc atcgattcgc tccaaggaaa 60 

atccacctcg cagcttgtaa atctacagcc tgattacatc aaccccagag ccgtgcagct 120 

gggctccctt ctcgtccgcg gcctcaccac tctggtttta gtcaacagcg catgtggctt 180 

cccctggaag acgagtgatt tcatgccctg gaatgtattt gacgggaagc tttttcatca 240 

gaagtacttg caatctgaaa agggttatgc tgtggaggtt cttttagaac aaaatagatc 300 

tcggctcacc aaattccaca acctgaaggc agtcgtctgc aaggcctgca tgaaggagaa 360 

cagacgcatc actggccgag cccactgggg ctcacaccac gcagggaggt ggggaagaca 420 

gggctccagc taccacagga cgggctctgg gtatagccgt tccagtcagg gacagccgtg 480 

gagagaccag ggaccaggaa gcagacagta tgagcatgac cagtggagaa ggtactagtc 540 

aaccttcaga aagagtatgg agagaaaaag aggcacacct ggacgcagag ccctgccagc 600 

gccctctctg ctgttgcagc tgcaaggaga ccatgcctgt gggagccagg cctcgcttgc 660 

atgaanaagg aacgatgcct ttttcaatgg tgtcttcctt ccattgtgca naanaacctt 720 
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ttggtggctt ctcttccgac ttgtgcctga tt 



752 



60 
120 
180 
240 
300 
360 



<210> 647 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 647 

ttaatccttt caattcgttc ntctttggat ccatcgattc gaattcggca cgagccctcc 
ccggcttccc ccggagtggg tcaccacact gttttttatc atcatgggaa tcatttcatt 
gactgtcaca tgtggtttgc tggtggcttc ccactggcga agagaagcta caaaatatgc 
tcgatggata gcattcactg gaaccactat gagaagatta taggaaaaac accaagacta 
gaggactctg ggttcctttt atgcaaagtc aactcttctg ggtcacagtt acccagcaac 
aaaaataaag agaggaccag gacgatgcca gcaccccgtt tatcctgagt gaactctccg 
gaggcctctt caagcttgtg ggttctctgc tgtcttgaag ccatccatcc atttgatagg 420 
ttttgcaaag acttggtcct gccaagatgg ttttaatcat ttctgctaaa aggaatggac 480 
tcgaggattt gatctcattt tagatgcagt tgtcctcact tggccatttt acagcacttt 540 
agtaaatatg gccagtgtat ttggtcacta ttaaatcaat ccccattcat tatctgtcan 600 
ggcaactcag tgaactaaat actatgttct gacctctggc actctttctc atgttggtta 660 
aatatttaat attgnctaag gcaattcaag tatttttctt aaataaaaaa tatgaaaact 720 

743 

caaaaaaaaa aaaaaaaaan ana 

<210> 648 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 648 

ttttaatccc tttcatttcn ttccttngta ggatcccatc gattcgtttt tttttttttt 60 
ggtgattgga ttaacaattt tattctgnnt ccactacaaa ngggctggtg ttttgttcca 120 
aatgtttagc tgggagggct gtagggaccc ctgttacccc cattaaacac agtaaagcat 1Qrt 
ggatccagtc agccccctgc tggcaggtgt gggcctggca actacacaga tccaacccca 
ccctcctggg tgcggccaga ggccaaggca gtcgcccgag ctcctgaatc ccaagaatgg 
ttctggcaag tactgctgtt tgtttgtagg ggcaaagagt taaaataaaa cgaggttctg 
ccatggctaa gccttgtgga aaccagaccc caaagcccct gccatgccan gggtctcaac 
nccagacgct tgttatggag gcaccancng gtantggccc ctgtaagcan ggccagagtc 
gggacaaaga gcaagantga aacanccaag agacanagga ccatgctgga ccattgggca 540 
cncangaacc tgcctgggaa aaaccggggg gcaangctgg catgggaatg aacacctgct 600 
tgntgacacc tatntgagct tcanttncct taacttgaaa aattgaacan gcccggtncg " n 
gtggctcata ccctgtaatc ccancacttt tgggangctt tangccgntt ggatcattga 
ngttaggaag attaaagaac cancctgggc cnacattgg 

<210> 649 
<211> 746 
<212> DNA 
<213> Homo sapiens 



180 
240 
300 
360 
420 
480 



660 
720 
759 



<400> 649 

tnancctttg aatccttgaa ngnngatccc tcgattcgcc ggaacctcat ccagtgccac 
ccatcttgac accttctccc tcttcagctt ttccaacagt cactactgtg tggcaggaca 
atgatagata ccatccaaag ccagtgttgc atatggtttc atcagaacaa cattcagcag 180 



60 
120 
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acctcaacag aaactatagt aaatcaacag aacttccagg gaaaaatgaa tcaacaattg 240 

aacagataga taaaaaattg gaacgaaatt taagttttga gattaagaag gtccctctcc 300 

aagagggacc aaaaagtttt gatgggaaca cacttttgaa taggggacat gcaattaaaa 360 

ttaaatctgc ttcaccttgt atagctgata aaatctctaa gccacaggaa ttaagttcag 420 

atctaaatgt cggtgatact tcccagaatt cttgtgtgga ctgcagtgta acacaatcaa 480 

acaaagtttc agttactcca ccagaagaat cccagaattc agacacacct tcaaggccag 540 

accgcttgcc tcttgatgag aaaggacatg taacgtggca tttcatggac ctgaaaatcc 600 

atacccatac ctgatttatc tgaangcaat tcctcagatt tcaactatca aaactaggga 660 

aaactgngag tttaacacca agtnctacaa cacaaggttg gaaacacctg aacttggngg 720 

atcatgatac cacttnacca ctccnt 746 

<210> 650 
<211> 789 
<212> DNA 

<213> Homo sapiens 
<400> 650 

tgaccctttt gaaantcctt gcatntttca naccntttgg tacnnncant ttnngntgga 60 

tccctcgttc gctgnacaaa agatgttttt caattaaaag acttggagaa nnttgctccc 120 

aaagagaaan gcattactgn tgtgtcagtn aaaggaancc ttcaaagctt tattngatga 180 

tgggtttggt tggactgtga gaggatcgga acttctaatt attattgggc ttttccaagt 24 0 

naagctcttc atgcaaggga aacataagtt ggaggttctg gaatctcagt tgtctgaagg 300 

gaagtcaaaa gcatgcaagc ctacagaaaa gcattgagaa agctaaaatt ggcccgatgt 360 

gaaacggaag agcgaaccag gctagcaaaa gagctttctt cacttcgaga ccaaagggaa 420 

cagctaaagg cagaaagtag anaaatacaa agactgtgat ccgcaagttg tggaagaaat 480 

ccccaagcaa attaagtagc caaaagaagc tgctaacagg atggactgat taccatattc 540 

gcaataaaat cttgggccaa aagaaaattt gggttttgaa agaaaataaa aattgatngg 600 

aactttttgg aattccagaa gactttgact acatagactt aaaatattcc atggttggtg 660 

aaaggatgta ccaagctttg tgaaatattg taaattttta aacctattat ctactaaagt 72 0 

ngtactggaa ttgtccnttt gcctgttnac ttgnggtnta ntcatttnta tttaatgntn 780 
aaattaang 



<210> 651 
<211> 757 
<212> DNA 
<213> Homo sapiens 



789 



<400> 651 

tnnnnnctaa ncctttgaaa tcgtccntgc atgatccctc gattcgaatt cggcacgagc 60 

agatatttac tgaaggaatc taggttgttt tttcagtgga caatgggaat aanncatttc 120 

taaagcaccg actggagagg aaggcaacag agacaaggag agaagccgag agacatgtct 180 

gcgtgctgcc acgcatctga gcgattgctc tgtgaagagt tgtacactga acattttcag 240 

gggaggctgt ttacccaggc aatgtcctca aacaagcctg tgccggggtg tcctggaatc 300 

tgtgccagga ctgtgttttt agcccttcac ctctcagctt tagcaggaca tgaaccagtt 360 

ataacaagat ggccctgcag ctggttacag gaatgtgaca tggcaggatc tatggaacca 420 

aatggaaggt tttnaggtga tgtaggtctt tcacagttag ctttggggaa tacagaatac 480 

tcaaataaag tgctttgtta ttatttcaga gggaatggcg attgaaatgt tacaacagag 540 

atttcttggt ggtagctatt tgggtaaang tatatggata tttntctgta catgtgaaat 600 

tatntaaaat aaaagttata taaattacat tgacaaaaaa aanangtana aaaaaaactc 660 

gaacctttaa aaactatngt ggagtccgta ttacgttaga tccagacctt gataaganac 720 

cattgatgaa ttttggacaa accccactng aatgcnn 757 
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<210> 652 
<211> 759 
<212> DNA 
<213> Homo sapiens 



<400> 652 

tcnnnccttt aatgctttga actcgttgca ctgcangatc catcgattcg aattcggcac 
gaggctgncc aggcagttnn atggcctnct ggttgtgtgc cttcacaccc gcctacagcc 
ccacctcacc atcaagcgct gagccaatgc ggntgtggct ggccctgagt tcctgagtca 
gctccttgcc agggccagag ctggtnacag cggggcanca nggtgggtag cctctaccag 
ncagggcagt ccctgagggg ccagcanggg ggctgactgc ctagtggctn aacctactga 
acccacccac tcccagcgat gctacccaga accccaacgg cntgaatcct gcacantgcc 
gggcantgcc agactcnaaa gggctcgctg tggggacagc cccgtcatgg ccacanactc 
tgtcctcacc tttgattgtc aggatgacag nccccaccac catgatgagc gtctgcaggg 
cgtccgtgta gattacagca gccaggcccc ctgccaatgg aagcaaggtt gctggaaggg 
gccctggtcc agggaggaag gacaccggga ggaacttctg ggcttctgct ggggccactt 
cctgggctgn tnctcnggnc tgtatgggga agtggccttn tgaccccttt acacgttccc 
tgggtggacc ttccctgntt gcangcaccc ataccttgcg atggtgtnca nggctnttga 
tgcccnaaac tttaggattg ttggatangt nnaatctnc 



<210> 653 
<211> 820 
<212> DNA 
<213> Homo sapiens 



<400> 653 

tgcaatcccn cngnnaatcg ctttgaaanc nccntcnctg tatgatccca tcgattcgca 
acagtccagg ctctgcagac agcatcccac ctgtcccagn tngctgacct gaggagcatc 
gtggnggaga ttgaggacct tgtngctcgc ctggatgaac tcgngggcnt gtatctccag 
ncanaanaan gacngcatac aacagaccat tangangntg tcatctacan tntnanngat 
catntgngna cngacccatc cattaatgag gatcanggcn tccanctgat gaacgctgat 
cttctgcaan aagaacgttc tagntctanc nnanngccnt cancctncgn ctcttgagct 
cagtgngtca ngctcntaan atcttnncac ntgccaanct gtgnggnctg ccttnagnct 
tccggatagg cactntnatn ngacntgccc tatanttgcc ngcngnnant naaccaantg 
naccatngtc actctgttga catcanggcn atntgnntaa actaatnnct tngcngcact 
ctagtnngcg ttgncactgc ccncgtnnnc tancntacca nttcncattn cccntttaat 
gggnaaagan atnatcccta cnatcatatt ncccntnnaa tggattcgag ncgnaantct 
tnnntantna tctnaancct aaatgntcac atnnaaactt tanangncat cnnnnatgna 
accnancnat ggctaaangg cctcattaan gccngntttt tcaaacttga aaantgcatn 
ccnccattga naaagganta cacgggcccc cntgngnggg 



<210> 654 
<211> 768 
<212> DNA 
<213> Homo sapiens 



<400> 654 

tttnnccccn ttttgtncct nttgattcnc ttgctacntn ttcaaatcng tnggatccca 
tcgattcgcc acatttaagt gagatatggg aaggaggagc agattgtttt tgaagggagg 
aagagcagtt acttagggtc aaattaagtt gtaaaatccc ccccgggatt ttgtatgtaa 
gtcaaagtga attgtatttg gaagaagaac tggggagccc acctctggta ttttttttat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
759 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
820 



60 
120 
180 
240 
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<210> 656 
<211> 754 
<212> DNA 
<213> Homo sapiens 



<400> 656 

ttttcctttt natcttgctc nanaanccnt ggatccctcg attcgcagag gctggttcag 
aaaaggagga agaggcccgg ctggcagccc tggaagagca gangatggag gggaagaagc 
ccagggtgat ggcaggcacc ttgaagctgg aggataagca gcggctggcc cangaggagg 
agagtgaggc caagcgcctg gccattatga tgatgaagaa gcgggagaag tacctgtacc 
agaagatcat gtttggcaan aggcgaaaaa tccgagaggc caacaagctg gcngagaagc 
ggaaagccca cgatgaggcg gtgaggtctg agaagaaggc caagaaggca aggccggagt 
gagtgcctgc ggcccctcac agggctgang ccagccccta tcagctggat gtggcagagg 
catgccanag gacctaagtg tgatggacca gantcacttc tnctcctcct ttctncacca 
gccctgaccc ctcatgctct ctggctgggc cantgggcaa ccctcgcttc cttggatgga 
ctgcctgctg gtgcctggtc agagaanagc ctnttttccc agnctgattc tntgctccca 
ggaaccaatt gaccatnaag gtgcaaangc cnanccaatc cccttacnta ctggccccca 
ttnattcctg gctttttcan aagcccccnt gccaaacann ttgggacccc ctgattnntt 
aagggtgcct tttnatnggg gttaaaggtt aant 



<210> 657 



197 



300 
360 
420 
480 
540 
600 
660 
720 
768 



gtccctcata tggacaaata aacctctggt attaaatgaa ttttcttttg ggggattcta 
tatattcggg atttcaacca ccaacctatc tggtttttcc cgctgaaatg ttgggtgatg 
gaatcaggag agcagatttg gagactcttt atattttata attgagagag acaaagagaa 
aaccgtttga tttgaaaaag ttttctaggt tccctcaggt agatggaaat tttcatcaaa 
aacagtttat tcaaggtaca tagcctacta gtttcccatt tgagagtacc gcagaatgat 
acgacgtgta ctgcttctct acgcagaatg aagtataaaa ttagcaccna atagtacttt 
aatttgcagg tgctaaactt tttacatgct tnatctcatt taattcttag aagaaactaa 
ttttaccaag taaantgtct ggaccaacca tntgcaggtc caaaannctg gaaaaaccgt 
naggtttgga ctcctacata gcctnttttn taagtnncnt nntaaatn 

<210> 655 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 655 

tntncctntt gatccttgca ctannaaatc cgtggatccc tcgattcgaa ttccracacaa *n 

gggcaaaccn atttatataa tagaaggatg attataaaca tttaataaat tatatcaaat 120 

agatattata tattaaatgg gcagataata gaaatctgtc caagcaaaac tctggataat 180 

ttttatgttg ccttattttt tgttttctgt gaactccaag aaaaatgaga taccagtttg 24 0 

gaacagatgt aatattgctg atttaacagt ttagggatac tccccaagtt caataatttt 300 

ff*"*!^ aaatttaaat ggaacctttt atgaagcttc atagtgtgtg aagaacttac 360 

cttgtttata tgtttgaaga catacatatt tcacatttca gaagagtcta tacatagctc 42 0 

accaaatatc aaaaccacct tgttagaaaa cattaaggtc tgtcttattt atttgttcat 480 

ttgnttatga gacacantct cactctgtaa tctcactctg ttgtagaggt tgagtgcagt 54 0 

ggcacgatca cggctcactg caacctncat ctccctgact caaggaatcc ttccacctca 600 

gccttccaag tagcanggac caccaggtgc accccactat gcccagctta attttttgna 660 
ttttattgga cagattgggg ttttgcccat gttattcagg ctggatcctt nnggcctcaa 
actcctgggg cttcaagcca atctggcctg cc 



720 
752 
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<211> 734 
<212> DNA 
<213> Homo sapiens 

<400> 657 

tntgttccnc natgaacgnt ngaancnnna tnccnttgga tcccatcgat tcgctgcggc 60 

cgcaggagct gtggcggttt tcctaatcct gcgnttatgg gtagtgcttc nttccatgga 12 0 

cgttacgccc cgggagtctc tcagtatctt ggtagtggct gggtccggtg ggcataccac 180 

tgagatcctg aggctgcttg ggagcttgtc caatgcctac tcacctagac attatgtcat 240 

tgctgacact gatgaaatga ntgccantna aatnaantcn tnngaactan ancgagctga 300 

ttganaccct agtaacatgt ataccaaata ctacattcac cgaattccaa gaagccggga 360 

ggttcagcag tcctggncct ncaccgnttt caccaccttg cactccatgt ggctctcctt 420 

tnccctaatt cacagggnga agccngattt ggtgatgngt tacngaccac gaacatgtgt 480 

tcctatctgn gtatctgncc ttatccantg ggatactagg aataaagaaa gtgatcattg 540 

ntactttcaa agcatctgcc gggttgaaac gatntncatg tcccnaaaga tttgttgatn 600 

tgcagctnct cantgctann gtcggttttg aanaaagttt nccaaatnnn tgtaccttgg 660 

gccaatttnt ngacaantng aactgacttg tnagaatctt gcagntaacn gtcttgtntc 720 

734 

ntccaattng ggng 

<210> 658 
<211> 783 
<212> DNA 
<213> Homo sapiens 

<400> 658 

ttctcctgaa acgcttngca cttccctcnc tgcaggatcc catcgattcg aattcggcac 60 

gagacactgt cccactccat cacccaggct ggagtccagt ggtgtgatca tagctcgctg 120 

catcctccag ttcctgggtt caagccatcc ctcctgcctc agcctcccca gtagctggaa 180 

ctacaggtgt gtgccatcac acctggcttt acatttttct gtggggtctt actatgttgc 24 0 

ccaggccggt ctcaaactcc tgagctcaag tgatcctctg nctcagcctc cagagtatct 300 

gggattacat atgtcggcta ccgtgtctgg ccgttcacat ctttggccac tattngcttg 360 

tgaaaaggta tnatgaggtg gtacttatca tngttactgt gtctcatgtt nngtatattt 420 

ttgcttcatc aactaagatg cactgtaaca tctgtgaaat ctggatatat tatcaaangg 480 

tttatcatag ttttgttaac aatacactgt cgttttactn ggtgcctaan ataatggtat 54 0 

agttgngagg tgatcttaga tttgatgaag cacagtatgc aangtaggcc taatggnggg 600 

aaagaatggg naattttcan angcnnggaa gtatttgntn ttttgtaaat ggacttgaaa 660 

agcttgttct gnnggattgg acccaacccc tttccctttn aaaccccgaa ttctnatnga 720 

ctnttccaac ttngaaaact ttgctcnaac ttaaatacct ttnaaaaatt aaccntgacc 780 



783 



<210> 659 
<211> 741 
<212> DNA 
<213> Homo sapiens 

<400> 659 

tcttcctttg tatacctgct nttgctcttt ntgcaggatc cctcgattcg ctttgagcta 60 

ggataaaaat tgggtaaagg acatttgctt acctgcaaat gaatcactgt ggaaatgtga 120 

tcttcccata tcatcaagaa acttgttttc tggatgaata ctgggagaat aaaatgagaa 180 

ctctggagtg agctaaattg atcccaatta agtttttctg cttagcagac agaaggtata 240 
attttttgac accctttccc acctggtgcc tatgctaggc ttgtcctgag aacatccctc 



300 
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agtaacttga tattcacatg acctacagga tgtcccatct gcagggctga gtcagttggg 360 

gaacaccaga ggctacacag tagctcttcc tgctactcgg ttaatgagct tggcaggttc 42 0 

tttgtctcac tgaattctta tcatggaaac agcagcagca gccgctagga aatcttcaag 480 

tgtagtgtct gtgctaaccc agtggtaaat cccttagatc ccctgctggt ctctggcagt 540 

ctccttgatt ttgggtacca tgtatatttt ccgctttgac tttaacgctt tctaggatag 600 

ggtaagcacc cttaattcan gcactgtcca ttagcttcct ttgcaaaagc tacttatggn 660 

cggtcacaat ncaacactna nacagagcca aggcaatatc ctcttgccca tggctatgat 720 

gtcagacagt ggatggctcn t 741 

<210> 660 
<211> 734 
<212> DNA 

<213> Homo sapiens 
<400> 660 

tctgnnctnt gtntccttgc tcgtgttctt ttgcaggatc cctcgattcg aattcggcac 60 

gaggactgga gaagtcagaa gtagaaaagc agattgctag gagagacagg atgacagatt 120 

ttyyfcuagaa aacgggatac tggagtttaa agtatcaaat acagaatagt tccagatgtt 180 

cagagatcca gcatgggatt aggtactgaa atggattaga actaaaagtc actagaattt 24 0 

agaaattgag aaccatgaga gtggatgcaa tgacttgttg cttgattgaa aaataaatta 300 

ataataataa aggaccatga gactagcctg ttataggggt tatctccatg aacattgaat 360 

tttcccagga tcatagcagg aattgggtag agaaaaagat tatgagaagg tgccagagtc 420 

ttcagtgaat gtcaggaaat taccaggaag tcagcatatg acagagaaaa ggacagtatg 480 

ttatctgcat caaaggaaaa tgtgcttttg ttgaaaagta cagaaaaagc caatactaca 540 

atactgtgct aagcccctac ctgtactcct ctcccacagc tgcattccag ccctgtggta 600 

taaaaggtgt gagaatgagc ttttccacca gaatcagcag gtttagttaa agcatgagca 660 

gaacaagcat nctatgaaga gactgaggat gtaggtgagt ggtctaaatc tcatnnaagg 72 0 

acattgcagt ngat 734 

<210> 661 
<211> 762 
<212> DNA 
<213> Homo sapiens 

<400> 661 

ttnnnnnnct ccnaatcctc cngatnanat cnctttgnan ctncctgcag gatcccatcg 60 

attcgaattc ggcacgaggt ccatacatgg agctccctgg agcccgtgtg ntntcgtgtg 120 

actgaacgtt ttgtgatgaa aggaggagag gctgtctgcc tttatgagga gccagtgtct 180 

gaattgctga ggagatgtgg gaattgcaca cgggaaagct gtgtggtttc cttttacctt 240 

tcagctgacc atgaactcct gagcccgacc aactaccact tcctgtcctc accgaaggan 300 

gccntngggc tctgcaaggc gcanatcact gccatcatct ntcagcaagg ngacntatat 360 

gtnnntgacc tgnagacctc agctgacnct nccttngtan ggttngatnt nggaagcatc 420 

ccaaggngat ttagngacnn tggantcctn atnactgata anacncnaac tatantnttt 480 

tacccttggn agcccaccag caagaatgag ttggagcaat cttttcatgt gacctnctta 540 

acanatatac tctgaatgaa tctacgttgt atttatcagg nggacaatgg gaataaagcn 600 

tttntaaagc accnantgga catgaaagca acagacacna ggagnnaagc cttgagacat 660 

gtctgnnntc tgaccgcatn ttgatccant gntctgtgan ganttnttca ctgaacattt 720 

tcaagaggag ggtgnatacc cctggcaatn gcccnaanaa ag 762 

<210> 662 
<211> 745 
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<212> DNA 

<213> Homo sapiens 
<400> 662 

nanatccnnc nantncttnt tgttcntgtc cgnangatcc catcgattcg aattcggcac 60 

gaggtttcat ttaagaagaa tganctagat anatgtgctc ttctggttac cccaccctga 120 

cagagtgcat ttttacacgg ctagcagggg ttgagactgc agcctggcct gccagccatt 180 

ggaggtgttt aaggaagggc agataatgtg actctttgcg gggtgccatc tgcttaccca 240 

ttagcgagca nagggggttt ctgcgggtga cccccagcat atttctaggt tacttatggg 300 

cagatttgta agtgacaaaa ctccagctga tgctgggaat ggggagaggg cccttgaggg 360 

actttgtggt tttgtgcttc tggtttcctg gccaacccca gggtcacttg tctggaggcc 420 

cagctgggca ctaatgtctg ccaccgacta tgttaaagtg tataaatgat tcctctattt 480 

gggagagatc ttccaatcca gaggagcccn tcttggactg cctgggttaa atctgcatan 540 

cagangtggt tgatgaagtt catctgaaga aattcagccc cacctnccca ccctgccntt 600 

cctgctccct tttgatagtg gcttctgggt actcgggcnn gtncttggga caccancctt 660 

ntctgggggt ctnaagccat cccgttgggg ctgtcggcca agcctaagtt aatcgtgtgc 720 

ctntattggg aggatngctn ntcct 745 

<210> 663 
<211> 748 
<212> DNA 
<213> Homo sapiens 

<400> 663 

taatcctntt gataanaatc cttgtncttg ctnntgancc ntcgattcga attcggcacg 60 

agggcaagtt tccaaagatc agtgtggagt gctacagaaa taattatagg agaggaaatc 120 

ataatcacag aaggtataat gcttgtttga ggctccggaa taagaactaa aaaaaaacaa 180 

aaaacactgg tttcatgctt acggggtaca cactttggtg catcccgtga acacaaattt 240 

taataccaaa caatccttga tgcttcacct ggggctgcca agcagtttgt aaaacagagg 300 

aaaacattta gtgcagtctg tattatcctt ttccaacttt tctgtttgtg caagtttttg 360 

aagattcatt ggccaaacaa tgaacaacaa aggttttctg agagaagaca aggtggactt 420 

ttcattttgt tagtaaatac cagtggcact gttgaacgaa acaaatactt ttatctcagt 480 

ctttcaaatc agtattaatg tctgtgtttc cttccactga cagctcttct tctagtttca 540 

ctgaaaaaag ggtgttagta tttttatctt ggacactctc ttccaaatcc ttcagcagct 600 

cctcttcttt atattctgcc acatcgacct ctaaaccgga attgtccttc agtttgccgt 660 

ggtgcttgag atantacccg ctggttctga aagaacttga tgatggtgta ctttgggaag 720 

gtcnaactgg gcanacagag tctggatt 748 

<210> 664 
<211> 785 
<212> DNA 
<213> Homo sapiens 

<400> 664 

gtnnnnccnc nnaccctnnt gaatntaatc cttgttcttg ctgcatgatc ccatcgattc 60 

ggtcaagctg gccctggatg tggagatcgc cacctnccgc aagctgctgt agggcnagga 120 

gtgcaggctg aatggcgaag gcatatggac aagtcaacat cnntgnagng cagtccaccg 180 

ncttcagtgg ctatggcgnt gccagcgntg taggcagcng cttaggcctg ggngnnggaa 240 

gcagntactc ctatggcant ngncttgncn ttggatgcng cnntagtncc agcagcggna 300 

nagccactgg gggtggcctn agctctgtng gaggcggcag ttccaccatc aagtacacca 360 

ccacctcctt ctccagcatg aagagctaca ngcactgaan tgctgccgcc agctctnagt 420 
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cccacagctt tcaggcccct ctctggcagc atagccctct cctnangttg cttgtcctnc 480 

cctgncctcc antctcccct gccctaccgn gnagagctgg gatgccctca ctttntnctc 54 0 

atnaatacct gtttcactga actcctgttg cttaccatca tgtcncagtt atcagcactn 600 

aaancatgct aatgnccttt tataagnccc ngtatttatt acaagnatct tgaantctgc 660 

cattaaattc ttgaggaang aaaatgacct attatccccc ataaagaacc tgaaacttca 720 

agnctaangt cccagcntnc aacanggaag gagntccntt tttttnattn gctaaaccan 780 

tcctc 785 

<210> 665 
<211> 763 
<212> DNA 
<213> Homo sapiens 

<400> 665 

ggnngntgnn nntnntaatt nctnttnaat nncantcctt ggntctngnt ntagganccc 60 

atcgattcgc tgaaccctaa aggaaagcca gcaaaccagc tgcttgctct caggactttt 120 

tgcaattgtt ttgttggcca ggcaggacaa aaactcatga tgtcccagag ggaatcactg 180 

acgtcccatg caatagaact gaaatcaggg agcaataaga acattcacat tgctctggct 240 

acattggccc tgaactattc tgtttgtttt cataaagacc ataacattga agggaaagcc 300 

caatgtttgt cactaattag cacaatcttg gaagtagtac aagacctaga agccactttt 360 

agacttcttg tggctcttgg aacacttatc agtgatgatt caaatgctgt acaattagcc 420 

aagtctttan gtgttgattc tcaaataaaa aagtattcct cagtatcaga accagctaaa 480 

gtaagtgaat gctgtagatt tatcctaaat ttgctgtagc agtggggaag agggacggat 54 0 

ntttttaatt gattagtgtt tttttcctca catttgacat gactgataac agataattaa 600 

aaaaagagaa tacngtggat taaagtaaaa attttacatc ttgtaaagtg gtggggaggg 660 

gaaacagaaa taaaattttt gcactgctna aannnaaann actttccagc naanctaaaa 720 

aactnnancc tttaaactat antgagttcg nanaccnggn ccn 753 

<210> 666 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 666 

nnttnnatan nngctcttgt tctttttgca ggatccctcg attcgtctag acctctgaca 60 

tcatggtgtt ttcttaatgc ctcacattgc tggcacgggg atgtgccctg cctgccagca 120 

cctaggactt cgagttgggt tgcagcttat gacatgcatg ataggttttg gaaggtaact 180 

tttaactgca aacctataaa gtactatttt ttattttata aatgaacagg gttttaacgt 240 

gctcaacttt aatttttttc aattgtatga aggccttaaa aaagctacat taagcgtagc 300 

taaaattatt tattggacta aaaactaaca gaacttcatt tccagaattt ttttttttgg 360 

caaatgttta cattcaatta aggggaaaaa gtagaaccag cacaaatgag tggcagttgc 420 

tggagcataa ctgcttcaat aaatcttcat cttggggtaa ttacaggcaa gtcattttca 480 

catcctcttg aggttcagag catcagaatg aactctatga atacatgtgt aagtgccaga 540 

cagctgaatc tttatcaggt attgnaaaga tacacatatg atatgnttat taaaattgaa 600 

ataatgtaaa acacatgaat aaatttgcaa aaccaagatc acagtccacc atatgcactc 660 

tggtacctta aatttttttt ataaataatt naaaagggaa tattggaagc ttcttaaaaa 720 

aaaaaaaaan aaaaaactcg agcctntana acttttgng 759 

<210> 667 
<211> 760 
<212> DNA 
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<213> Homo sapiens 
<400> 667 

ggnnttnaaa ctnctaatnc tgtnttgcag gatcccatct atncntatan angctctagg 60 

cggngcggnt cccactctcg gaaccttgtc ctgtttgtcc cccagctcgg caagcgccat 120 

atgagcctgg cggcgccaga tgcgaatcct gttctgggct ttttggccta ttcccgcccc 180 

tcagtcttgc cgggatggca ccgcccgcat aggacttcca gggttgggct gantgggagt 240 

tcgactgctg ggcctcgtaa ttctcgcttt ggggctgctc cttccaggct gggacacact 300 

ggggcccgct gtcggtctcc cgtcctccga catcttgtct ggaacttccg cctggcagtc 360 

tccagtagga gtggagctct gtgcggcgta ntttggtgga aaaacnggcc ttgcgtcggc 420 

ctcaccccca gtgtttgtgt ttcagaatga agactattct cagcaatcag actgtcgaca 480 

ttccagaaaa tgtcgacatt actctgaagg gacgcacagt tatcgtgaag ggcccagagg 540 

aaccctgcgg agggacttna atcacatcaa tgtagaactc anccttcttg gaaaagaaaa 600 

aaaagaggct tccggtttga cnaaatggtg gggtaacaga aaggaactgg ctacccgttc 660 

cggactattt gtaagtcntg tncagaacat gatcaaaggg tgttacactg ggctttccgt 720 

tacaaagatg aangtctgng natgcttaat ttccatnaan 760 

<210> 668 
<211> 763 
<212> DNA 

<213> Homo sapiens 
<400> 668 

gntctatgtg gctctngttn ttttgcggat cccatttgac gccttggcac gagaagaaaa 60 

cccatggaaa gtagcagtgt tgtgagttgc agagacagga aagatagaag acgttccatg 120 

tgttattctg atggtcgaag tttacatttg gaaaaaaatg gaaatcacac accatcctcc 180 

agtgtgggca gctctgtaga aattagttta gaaaattctg aactgtttaa agatttgtct 240 

gatgccattg agcaaacctt tcagaggaga aatagtgaaa ccaaagtgcg acgtagcacg 300 

aggctacaga aggatttaga aaacgaaggt cttgtatgga tttcacttcc acttccttcc 360 

acttcccaaa aagccaaaag aagaacaata tgtacatttg acagcagtgg atttgaaagt 420 

atgtctccca taaaagaaac tgtgtcctcc agacaaaaac cgcagatggc acctcccgtc 480 

tcagatccag aaaacagcca gggccctgct gctggttctt ccgatgaacc tggtnagagg 540 

aggaagagct tttgtatatc tacacttgca aatactaaag ccactttcca gttnaaaggc 600 

tnccggagaa gatcctctct ttaatgggga aagggagaga gctctcttga ctggccttgg 660 

gaaagggatt ggaacataat ggggagaaaa gaaagccgta attgacattt tctggcanan 720 

tcttgtnanc aagaggggna aagtnaccct tntntgcttg aaa 763 

<210> 669 
<211> 754 
<212> DNA 
<213> Homo sapiens 

<400> 669 

tgnttctaat gctngctctc gttctttctg caggatccca tctattcgaa ttgatgagcc 60 

ttattaacta tcttttcatt atgagacaaa ggttctgatt atgcctactg gttgaaattt 120 

tttaatctag tcaagaagga aaatttgatg aggaaggaag gaatggatat cttcagaagg 180 

gcttcgccta agctggaaca tggatagatt ccattctaac ataaagatct ttaagttcaa 240 

atatagatga gttgactggt agatttggtg gtagttgctt tctcgggata taagaagcaa 300 

aatcaactgc tacaagtaaa gaggggatgg ggaaggtgtt gcacatttaa agagagaaag 360 

tgtgaaaaag cctaattgtg ggaatgcaca ggtttcacca gatcagatga tgtctggtta 420 

ttctgtaaat tatagtttct tatcccagaa attactgcct tcaccatccc taatatcttc 480 
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taattggtat catataatga cccactcttt cttatgttat ccaaacagtt atgtggcatt 54 0 

tagtaatggg aatgtacatg ggaatttccc actgacttac ctttctgtcc ttgggaagct 600 

taaactctga atcttctcat ctgttnaaat gtgnattaaa gtatctacct aactgagtng 660 

tgantgtant gaaagaaagg ncatatntta aacnttgaat ttancaagcc cacnctcgna 720 

ttttatgncc tttcttttgc ctngggattg aanc 754 

<210> 670 
<211> 752 
<212> DNA 

<213> Homo sapiens 
<400> 670 

tgnttctaat anttgctact tgttcttttt gcaggatccc ttttgacgnc tttggcacga 60 

gaaagaaagg gctcgtgaca gagaaagaag aaagagaagt cgttcacgaa gtagacactc 120 

aagccgaaca tcagacagaa gatgcagcag gtctcgggac cacaaaaggt cacgaagtag 180 

agaaagaagg cggagcagaa gtagagatcg acgaagaagc agaagccatg atcgatcaga 240 

aagaaaacac agate tcgaa gtegggateg aagaagatca aaaagccggg atcgaaagtc 300 

atacaagcac aggagcaaaa gtcgggacag agaacaagat agaaaatcca aggagaaaga 360 

aaagagggga tctgatgata aaaaaagtag tgtgaagtcc ggtagtcgag aaaagcagag 420 

tgaagacaca aacactgaat cgaaggaaag tgatactaag aatgaggtca atgggaccag 480 

tgaagacatt aaatctgaag gtgacactca gtccaattaa aactgatctg ataagacctc 540 

agatcagaca gaggactact gttcgaagat ttttggaaga atactgagaa eggcataaag 600 

tgaagatcga catttaaaaa atgaggtgaa agaaagctnt tgtggcatag aaaaagtntt 660 

aagctcaant agttttttta ttattattat tattaaaagt tattcaggac tgatgtgact 720 

ncngatttna gaacatgtgg taatagtnta nt 752 

<210> 671 
<211> 752 
<212> DNA 

<213> Homo sapiens 
<400> 671 

tgnttctaat gttgetaetc gttcttttgc ggatcccatn ttattcgaat teggcacgag 60 

gatattcaca cagtatgtat tatattaacc atatcacact taagttatta aattcagact 12 0 

atttgtaact tattgttata gggcctgccg tatggcttag gatatttgag taatcatata 180 

tttaaagtaa aaactttggg ctgggcacag tggctcacac ctgtaatccc agcacttggg 24 0 

gaagctgagg tgggcagatc agttgaggtc aggagttcta gaccagcctg gtcaacatgg 300 

cgaaacccca tctctactaa aaatacaaaa attagctggg cgtggtggca cacacctgta 360 

atcccagtta cttgggaggc tgaggcacaa gaatcgcttg aaccegggag gcggaggttg 420 

cagttageca agatcgccct gctgcactcc agectgggea acagagggag actctgtctc 480 

caaaaacaaa aacaaaaact gttagtgaag gttccctggg acttttgata ttttaaaaat 540 

tggtcttatg actagtagat aaattcattg ccataatgag gctagctccc agataaacag 600 

tgtattttct tctttttttt ttttggtgag tggtccaaac tttaagctac tttttccagt 660 

antttgecac tttctccgan gtaantttgg ctggtctttn agtaatgeta attgngtgtc 720 

aaaatttgtc tacaacagtt nggcaacaga tn 752 

<210> 672 
<211> 792 
<212> DNA 
<213> Homo sapiens 
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<400> 672 

tgnttctaat actngctact ngttctttct gcaggatccc tctattcgaa ttcggcacga 60 

ggctgcttct ggctgggggg tccttggcct tcatcctgct gagggtgagg aggaggagga 120 

agagccctgg aggagcagga ggaggagcca gtggcgacgg gggattctac gatccgaaag 180 

ctcaggtgtt gggaaatggg gaccccgtct tctggacacc agtagtccct ggtcccatgg 24 0 

aaccagatgg caaggatgag gaggaggagg aggaggaaga gaaggcagag aaaggcctca 3 00 

tgttgcctcc acccccagca ctcgaggatg acatggagtc ccagctggac ggctccctca 360 

tctcacggcg ggcagtttat gtgtgacctg gacacagaca gagacagagc caggcccggn 420 

ccttctgccc ccgacctgac cacgccggcc tagggttcca gactggttgg acttgttcgt 480 

ctggacnaca ctggagtgga acactgnctc ccactttctt gggactttgg agggangtgg 540 

aaccggcaca ctggacttct tccgtctcta nggctgcatg gggagccctg gggagcttna 600 

atnnttgggg gatcccnnaa aangaccccc tgtcccccat anacttgggt ttttngcttt 660 

canccctttc cccttggccc cnnttgacca cttcatggag tttaattaaa atngcccttg 720 

gtangaaaan anaatantnt tcctcntttt antgntnttt tnntataatt tnatnatcct 780 

antnatcntn nt 792 

<210> 673 
<211> 755 
<212> DNA 
<213> Homo sapiens 

<400> 673 

nttctaatnc tngctacttg ttctttntgc aggatccctc gattcgaatt cggcacgagg 60 

cagcttcgag ccaatggtga gctccttctg gatcagctcc ttcagctcct tcttgctcag 120 

gatgctgaaa ttgcaaggct gatggaagac ttggaccgga acaaggacca ggaggtgaac 180 

ttccaggagt atgtcacctt cctgggggcc ttggctttga tctacaatga agccctcaag 240 

ggctgaaaat aaatagggaa gatggagaca ccctctgggg gtcctctctg agtcaaatcc 300 

agtggtgggt aattgtacaa taaatttttt ttggtcaaat ttaaaaaaaa aaaaaaaaaa 360 

ctcgagcctc tagaactata gtgagtcgta ttacgtagat ccagacatga taagatacat 420 

tgatgagttt ggacaaacca caactagaat gcagtgaaaa aaatgcttta tttgtgaaat 480 

ttgtgatgct attgctttat ttgtaaccat tataagctgc aataaacaag ttaacaacaa 540 

caattgcatt cattttatgt ttcaggttca gggggaggtg tgggaagttt tttaattcgc 6 00 

ggccccggnn gccaatgcat tgggccccgg tacccaactt ttgttccctt tantgagggt 660 

taattgcncc ccttggccgt aatcatggta atagctgttt cctggtgnga aattgtttcc 720 

cgtnacaatt ncacacactt ttcancccgg ggacn 755 

<210> 674 
<211> 753 
<212> DNA 
<213> Homo sapiens 

<400> 674 

tgcttctaat gcttgctact cgttctttnt gcaggatccc tcgattcgca gatttttgac 60 

aaggaaggct aattctaaac ctgaaagcat ccttgaaatc atgcttgaat attgctttga 120 

tagctgctat catgacccct ttttaaggca attctaatct ttcataacta catctcaatt 180 

agtggctgga aagtacatgg taaaacaaag taaatttttt tatgttcttt tttttggtca 240 

caggagtaga cagtgaattc aggtttaact tcaccttagt tatggtgctc accaaacgaa 300 

gggtatcagc tatttttttt taaattcaaa aagaatatcc cttttatagt ttgtgccttc 360 

tgtgagcaaa actttttagt acgcgtatat atccctctag taatcacaac attttaggat 420 

ttagggatac ctgcttcctc tttttcttgc aagttttaaa tttccaacct taagtgaatt 480 

tgtggaccaa atttcaaagg aactttttgt gtagtcagtt cttgcacaat gtgtttggta 540 
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<210> 675 
<211> 760 
<212> DNA 
<213> Homo sapiens 



aacaaactca aaatggattc ttaggagcat tttaatgttt attaaataac tgaccatttg 600 

ctgtanaaag atnanaaaac ttaagctttg ttttactaca acttgtacaa agttgtatga 660 

cagggcatat tctttgcttn caanattttg ggttgggggc actangggtt caaaaccctg 720 
gcanaattgt cnactttagn ctgaccataa tnc 



753 



<400> 675 

tgntttctaa acnttgctct cgttntttnt gcaggatccc atctattcga attcggcacg 
aggttccctc accttattcc tccaagttcc cccttgggaa cctctgagat taacttgata 120 
agctccttgg gcaagctctt tatcctaaga ttcctcagtg agccttatag agttgctgcg 



60 



180 



agaattacat ttgttcatga tgtcaagtgt ctggtatgta gctaatgctt attgaacaca 240 

tagtaattta ttgaataatt gtcatgatca ctggatgaga tatagccact gtggaggtag 300 

gcacaccagg gttttagagg cttgggatct tgcaacagga ttttcctctt gcctctccaa 360 

accgcccccc gcccagatgg cttcagcatc tttttgcatc cctgtttcct tgtttggtga 420 

acacctgtct caacctgtct gcaaggcgtg gtgagattct gcatccttgg taagcactca 480 

tgtcactcca aaacagctgt ttgatgctaa tagcacacat gaggtcttgc aaatttgtct 540 

gaggaactac aggacattgg agagatattt atcaaacacc cactacatgc ctgatactta 600 

actanggaac tatnaaagtg ggtggtgaag acaagtgnga agtaaantgc aaacctattt 660 

ccatatatgt ttgnncgcta gattgntncc ancaattngc ntcttggaat tgttgaattn 720 

ggccctgtgt gtgtgcctgt ggtaantgga nntgngtttc 760 

<210> 676 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 676 

ntttgaaact tnctactngt tctttttgcg gatccctcna ttcgaattcg gcacgaggca 60 

gaaccttttc ccctctactc ttgtctaaaa gttctgtgtg gcacacagag atgcgaccta 12 0 

ctcaatctga cttagtaaaa ccatgctgta gaatttttgt cttaaaaaga ccacataccc 180 

agcacccatg aaataaaaga ttcatctgta attgggattc aaagtgatta aattcctttg 240 

ttcatactca taaatagcac taaagtgtta taacattttc atttacctat ttttagttcc 300 

ttcattttaa cttaataaaa atcttggatt gatattcttt tttttttttt ttgggacgga 360 

gtctcgctct gtcacccagg ctggagtaca gtggctctat cttggctcac tgcgagctcc 420 

gcctnccggg ttcacgccat tctcctgcct cggcctgccg agtagctggg actgcaggcg 480 

cccgccacca cacccggcta atttttttgt atttttagta gagacggggt ttcaccctgt 540 

tagccaggat ggtctcgatc tcctgacctc gtgatccacc tgcctnggcc tccaaagtgc 600 

tggaattnca ggcgtgagcc accgcgcccg ggnctaaatt ggatattctt taaccattaa 660 

aaggtttact gggtgnccna tttgccatat tattggaaac ttggaaaggg taatttgaaa 720 

caaagntttg aagttaactg aaatttgggg a 751 

<210> 677 
<211> 756 
<212> DNA 
<213> Homo sapiens 

<400> 677 
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tgctttgaat 
ggataaactc 
aaaagtacca 
gcatgacata 
agagatgtat 
tgtatgaaaa 
gtgaaaatta 
cagattgcca 
aaaaagactt 
agtttagaag 
attttacaat 
gntnttatct 
atttaaaaaa 



cctttgtaan 
ttcagtgacg 
ataatgagta 
aaagccaaag 
gcacatatac 
agaggcaaaa 
tgacattaaa 
gcgcaggttg 
ggaagaagct 
cctgaaactt 
ccattatttt 
ggntcatata 
aaaaaaaaaa 



cgccctntnt 
aatattagaa 
ggaaatcact 
gtcacaaggg 
atacaacagt 
caacaagaag 
aagcaggcag 
gaagccgcat 
gaggaatata 
ttctcgtatg 
tgaccactgc 
tatttctttg 
acccggactg 



gcatgatccc 
ttagttagtt 
tctgcagtat 
aaaaaactga 
catagtaaga 
aaaagattga 
agatcctaca 
atttggatct 
aagaagcacg 
gggtggtttt 
tatgtgttca 
gctaatttaa 
ttttnt 



tcnattcgaa 
atacatttga 
ttttggagca 
tagatttgtc 
caccgttaga 
aaaaatgaga 
agaatccagg 
tcaacggata 
tttagtactg 
tgcattaaat 
agtagtatga 
tatgtcaaat 



ttcggcacga 
ggaaaactat 
ttttccttaa 
tgtgatattg 
caaaaggtga 
gctgaagacg 
atgatgatcc 
ctagaaaatg 
gattcagtga 
nctggggtcc 
gaatgtgatt 
aaatgagttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
756 



<210> 678 
<211> 756 
<212> DNA 
<213> Homo sapiens 



<400 
gnnnnnnnnn 
aattcggcac 
aagccattcc 
ttgtggcttc 
cccaaaagcc 
atgtgcttag 
gttttaatgg 
atgaactgtc 
agagagctgg 
taagggaata 
atattgaggc 
cctgcattta 
tactatctct 



> 678 
nnnttnnaat 
gaggggtgtt 
aggaggaact 
cagagaggac 
ctgagtttct 
gaagaaaaaa 
acatgggagg 
tatctgggat 
acagactcca 
actagtgcaa 
aagctgtacc 
tctctttggg 
gtancattaa 



agnnagctac 
ggagcagatt 
agatccacca 
tccaaaagaa 
gaaaaaatat 
gtggtaatga 
agccataaaa 
accctatcct 
ttttagcctc 
gctgacttca 
agcagcttct 
atagtttaag 
tttttttact 



ttgttctttt 
gtagttgatc 
cttcctctgc 
agcacaaaaa 
tgaaagtttc 
ttcaaggaaa 
gtagtctatc 
gttttaatct 
ttcacttgca 
agcacattca 

gggggacctg 

cccctgnacc 
ttttgg 



tgcaggatcc 
cacagcaaag 
tgggcatgct 
ctagacagtg 
tatggtgaaa 
cataatcaca 
tatcatcagt 
gagtgactct 
gtccccttat 
ggaatgcact 
ctcantggat 
tggaactgng 



catcgattcg 
agcatcacca 
ccaaaaatgg 
ggagggcata 
taggaagtta 
cacggtttta 
tacatatcta 
ctctcagctg 
ccccctccct 
tactgataag 
ggtcccaacc 
tatttttctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
756 



<210> 679 
<211> 747 
<212> DNA 
<213> Homo sapiens 



<400> 679 

tctaatnctt ggctctcgtt ctttctgctt gatccctcga ttcgaattcg gcacgagaaa 60 

tgactccctg caaaacccaa cccatgctgc tggctgtggg atttttggtg taagcctatc 120 

tatgcactct atcagccaga atttggcatt tagctcttag ttaaatctag taaaggacag 180 

tctattgttt aaagagaagg tgcatttgtt cctcaatcaa gcaagagcac ctgtgttgta 240 

ctgctttata tctcatgtat atttatagta atgaaaagac tttttaaatt gtacacgttt 300 

cagtgccttt cttgtgttat gaaaggcagg tagatattat agccataggt aaaaatccat 360 

agttaaattg cacactgacc ttaaatctct ctgtgtatgc ccttgtatct tgcatgttaa 420 

aagttggatt attgggcatg tgtggcagcc tgccctgcta catgctagac aagtgtgctt 480 

tagtacatag ccacaagttc ttcattcttt aaaatgtttt gacagatcat ctcataataa 540 

aaataattca ngaaaactat ggggaaatag ttacatttca caaaagatat tttaaactct 600 

ttgtaaaact tagataatag agcctancaa gttactttgn atctaattgg atacatttta 660 
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tgnttaattt taccaccata cattttatta atcaaaattg gttagcatgt gactcttttt 720 
ggcttcanaa gttntcaaaa aaattat 



<210> 680 
<211> 750 
<212> DNA 
<2X3> Homo sapiens 



<210> 681 
<211> 748 
<212> DNA 
<213> Homo sapiens 



<400> 681 

ctaatncttg gctctcgttc tttctgctng atccctcgat tcgaattcgg cacgagccca 
gctgctcagg aggctgaggc aggagaattg cttgaaccca agaggcggag gttgtggtga 
gccgagattg cacctttgta ctccagcctg ggcaacgagc aaaaaactct gtctcaaaaa 
aaaaaaaaaa aaagaaaaag aaaaatggct tccaggacag agcatgctca tttgctggcg 
gacagttcca gaaacagacc ctgttagtcc ttctacttac ctgctggatt tttcaagcac 
taaatttata actttttgaa acaaaataat gtgtaatttt ccatttgggg gcaaactcta 
ttcttgtgag cattattaaa atcttgtttg taaatatatt gtctttctct taatatttgc 
tctgggtcan gaagaagctg ttcacggtgt gataatactc tttanattgt gctttcatta 
ttatagatgc atcatgtctt ctgctttcac gtgtctggga tggggtcaga aatgcatnct 
ccagntgaca naaaaatccn agnatgagat caanaaggat actggtgttt tctgactttt 
acaaaaatta ctttgntgtt ttcattaaaa aaaaagcttt aacctantgn ttncntantc 
cttttagaaa ntattaaatt tnaaaatgaa ttcnatanaa atanaannac naaaaaactt 
nntnccttta naactttagt gangcgtn 



747 



<400> 680 

ttctaatnct tggctctcgt tctttctgca ngatcccatc gattcgaatt cggcacgagg 
accggctggg cctacaaaaa gatcgagctg gaggatctca ggtttcctct ggtctgtggg 
gagggcaaaa aggctcgggt gatggccacc attggggtga cccgaggctt gggagaccac 
agccttaagg tctgcagttc caccctgccc atcaagccct ttctctcctg cttccctgag 
gtacgagtgt atgacctgac acaatatgag cactgcccag atgatgtgct agtcctggga 
acagatggcc tgtgggatgt cactactgac tgtgaggtag ctgccactgt ggacagggtg 
ctgtcggcct atgagcctaa tgaccacagc aggtatacaa gctctggccc aagctctggt 
cctgggggcc cggggtaccc cccgagaccg tggctggcgt ntccccaaca acaaa CtC rL 
c tcC ggggac gacatctctg tcttcgtcat ccccctggga nggccaggca gttactccta 
aggggcttga acaccatccc tnccactagc ctctccatac ttactcctct nacagcccaa 

^tgtctccc ttgacccttc tttaatggca acttaactga anaaagggat 660 

atCCaaaatt Wtatttg gcaaataaac canatggatt aaaaaaaata 720 

attntntctt aananaaana actccggcct 75Q 



<210> 682 
<211> 755 
<212> DNA 
<213> Homo sapiens 



<400> 682 

ctaatgctng gctttcgttc tttctgcagg atccctcgat tcgaattcgg cacgagcagg 
agcaatcaat tcctgtcgaa gtgaatacca tgcagctttt aacagtatga tgatggaacg 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
748 



60 
120 
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catgaccaca gatatcaatg cactgaagcg gcagtactct cgaattaaaa agaagcaaca 180 

gcagcaggtt catcaggtgt acatcagggc agacaaaggg ccagtgacca gcattctccc 24 0 

gtctcaggta aacagttctc cagttataaa ccaccttctt ttaggaaaga agatgaaaat 300 

gactaacaga gctgccaaga atgctgtcat ccacatccct ggtcacacag gagggaaaat 360 

atctcctgtc ccctaccgaa gaccttaaga cgaagctcaa ctncccgtgg cgaactnaca 420 

tccgagtcca caaaaagaac atgccaagga ccaagagtca tncgggctgt ggggacaccg 480 

tanggctgat agatgagcag aacgaggcca gcaagaccaa tgggctgggg gcagcagagg 540 

cattcccctt tggntgtcan gcgacagctg ggagagaang caagnaagcc ctgaangcna 600 

gtccaggagg accnncnaag ggcagtttcc ggagcccgnt gttccggaga tgctgatgtg 660 

ggntgtgtct gcanttcang gccaaanttg gggacccctg ggaactgtac cctangggnt 720 

ncttgnagnt taaaacttga ccttaanggn ngcct 755 



<210> 683 
<211> 755 
<212> DNA 
<213> Homo sapiens 



<400 
ggntttnnnt 
tcggcacgag 
ttttctacta 
atctgagatt 
aaacataaaa 
ccatttagaa 
tctcgcatat 
gcagcagtgg 
ccctgggacc 
ggaggaggca 
gactcagggg 
ttggtttact 
acanaattcn 



> 683 
ctttctaatg 
aattagtatc 
tattaagatc 
agaagtttaa 
gtgtttgttt 
atttaatatt 
tatattatag 
gtcagataac 
acctcaacat 
agacctgtgg 
aaagtactaa 
gagatgcttt 
ttaatatttc 



cttggctctc 
aacttacaat 
taatgaattt 
gatcacttga 
ctgttatgtt 
tattaataac 
caggccaaga 
cctgtggcag 
gcgtagcctc 
gtcaattttg 
accaaaaact 
tgtaaaggaa 
ttcctcntgg 



gcctntctgc 
ccaagtccaa 
gatttctttt 
ccccaaacct 
accataattt 
tgaaactgtt 
taaaattttg 
tgacacgggc 
ttgtcttaaa 
aactggnctt 
ctgattntac 
aataatgctt 
gcttn 



ttgatcccat 
gtatcatctt 
ttgaagtttt 
ttgtttatgt 
gatgtatata 
tgtcttcctt 
acagctcttt 
aaattggcat 
tgtactcccc 
actttgattt 
tttgcgtttt 
gngacanttt 



cnattcgaat 
ataatcactt 
ttcttgtaac 
aagaattttt 
gtgtccagat 
tggtatatag 
aagcccacat 
ttgaataaag 
atggcagcat 
taaaacaaga 
ctggaagtnn 
agtaatttct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
755 



<210> 684 
<211> 774 
<212> DNA 
<213> Homo sapiens 



<400> 684 

ggntttnann ctttnnaatn cctttgctnc tcgntctttn tgctggatcc catcgattcg 60 

caagatctgg aggaatgcag agaggaactt gatacagatg aatatgaaga aaccaaaaag 120 

gaaactctgg agcaactaag tgaatttaat gattcactaa agaaaattat gtctggaaat 180 

atgactttgg tagatgaact aagtggaatg cagctggcta ttcaggcagc tatcagccag 240 

gcctttaaaa ccccagaggt catcagattg tttgcaaaga aacaaccagg tcagcttcgg 300 

acaaggttag cagagatgga tagagatctg atggtaggaa agctggaaag agacctgtac 360 

actcaacaga aagtggagat actaacagct cttaggaaac ttggagagaa gctgactgca 420 

gatgatgagg ccttcttgtc agcaaatgca ggtgctatac tcagccagtt tgagaaagtc 480 

tctacagacc ttggctctgg agacaaaatt cttgctctgg caagttttna ggttgaaaaa 540 

acaaaaaaaa tgacatgggt gcagaagctt gtaacattga tcacattctt aatgtaaatg 600 

gtgtctttct tctggggttt cagtatttgc aaagaaantg aagaagaatt ctggaaatgc 660 

cattcaatta accctnagga aaaaagccga ccttanaaat ttaccttant gcnttgnnnn 720 

ttaaaaanaa aaaaaantna aaaaactttn accctttana ccttttgtgg ggnc 774 
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<210> 685 
<211> 759 
<212> DNA 
<213> Homo sapiens 



<400> 685 

ggntttnnan nctttctaat ncttggcttn agttcttttg caggatccca tcgattcgaa 
ttcggcacga gagtacccag agttgcgagg agttttttaa ctgatttagc cnnntggcaa 
tcatgagtga atggatgaag aaaggcccct tagaatggca agattacatt tacaaagagg 
tccgagtgac agccagtgag aagaatgagt ataaaggatg ggttttaact acagacccag 
tctctgccaa tattgtcctt gtgaacttcc ttgaagatgg cagcatgtct gtgaccggaa 
ttatgggaca tgctgtgcag actgttgaaa ctatgaatga aggggaccat agagtgaggg 
agaagctgat gcatttgttc acgtctggag actgcaaagc atacagccca gaggatctgg 
aagagagaaa gaacagccta aagaaatggc ttgagaagaa ccacatcccc atnactgaac 
agggagacgc tccaaggact ctctgtgtgg ctggggtcct gactatagac ccaccatatg 
gtccagaaaa ttgcagcagc tctaatgaga atattctgtc ncgtgttcaa ggatcttatt 
ggaaggacat cttacagctt ccaatgagaa gccaagaagt tgtgaacata ctaattaaaa 
aa« 9aC ccca ttttaatccc tcattaaaan ggttttaaat gttaaaaaaa aaaaaaaaaa 
acttcgagct tttaaactat ngtgagtcga ttcntataa 



<210> 686 
<2X1> 749 
<212> DNA 
<213> Homo sapiens 



<400> 686 

ggnnttnnnn nctttgaaat cccttngctn ctagcncttt ttgcaggatc ccatcgattc 
gaattcggca cgagggcaat tagcctcgct taagttgcct tttttacaca ccaaaacttt 
ttacatgaag ggctggtttc acatgaatac tatactgaaa tctgtgctct caagatctag 
cagtgaccag ggctgcccgg cgggggctct cctggcaagt caggaaggtt tctgttgcta 
atataacata gaaacacatt agtgcactgg gcctctctga ggtcagcata tttgtactct 
tggaatattt gtttttttct tcagtaacaa cagaaacccc agttgggagt ttaacaaata 
actgactacc actcactcat gcatttttat ttccaattaa agcaaagcac tgtgctgtgc 
tcagataata atagtttgta agtaaaagtt tttagttttc agtgttcagg ttatagaata 
taactgacca taaaaattac ctgcaggtat tttcttttta tgaacttgtt tttaaattac 
caagtaatta ctggtgtcat tttgttttat gacagacaca cgtatctaac aaacaaacaa 
acagtgacct tctccatggg tcaaggactt ccttacaatt tctnctgagt taacttttgt 
gaaaataatc ctaaggtttt ctggcttatt gaggaaattn ctacaaacaa caaaccaaca 
acngaagaga agatcatcaa ccactgttt 



<210> 687 
<211> 760 
<212> DNA 
<213> Homo sapiens 



<400> 687 

ggnnttctaa tgctttctaa taccttggct ctngctcttt ctgcaggatc ccatcgattc 
gaattcggca cgaggaaatg tgtatttcag tgacaatttc gtggtctttt tagaggtata 
ttccaaaatt tccttgtatt tttaggttat gcaactaata aaaactacct tacattaatt 
aattacagtt ttctacacat ggtaatacag gatatgctac tgatttagga agtttttaag 
ttcatggtat tctcttgatt ccaacaaagt ttgattttct cttgtattac attttttatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
759 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
74 9 



60 
120 
180 
240 
300 
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tttcaaattg gatgataatt tcttggaaac attttttatg ttttagtaaa cagtattttt 360 

ttgttgtttc aaactgaagt ttactgagag atccatcaaa ttgaacaatc tgttgtaatt 420 

taaaattttg gccacttttt tcagatttta catcattctt gctgaacttc aacttgaaat 480 

tgtntttttt tttctttttg gatgtgaagg tgaacattcc tgatttttng tctgatgtga 540 

aaaagccttg gtattttaca ttttgaaaat tcaaanaagc ttaatataaa agtttgcatt 600 

ctactcanga aaaagcatct tcttggatat gtcttaaaat gtatttctgt cctctataca 660 

naaaagttct taaattgatt tttacagtct ggaatgcttg gatgntttaa aatantaaca 720 

ttttatattt tttaaaagac aaancttata ttnatcctng 760 

<210> 688 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 688 

tgntttctaa tgcttctaat agcttggctc tngttctttc tgcaggatcc catcgattcg 60 

aattcggcac gagacaaaac ctacagatgg agataaaaat tactactgtt attcaacatg 120 

tgttccagaa ccttattttg gggagtaaag tcaattgggc agaggatcct gcccttaagg 180 

aaattgttct gcagcttgag aagaatgttg acatgatgta ataagaattc atttctgaca 24 0 

tattttacat ttctggcaat ctcaactctt atttggaata cttctgtgca tttgtctgtc 300 

caccgtaatt ttagaaaagc atatccataa cgtttacagt tgtagtacag ttgtggttag 360 

ttatttgtag tgggattgaa agtaattttt ttctttttat atttctatat ttagtttgtt 420 

tttttgttgt tgttgttttt tgagatggag tctcgctttg ttgcccagac tggagggcag 480 

tggcgcgatc tcggctcact gcaacctctg cctcccgggt tcaagcagtt ctgcctcagc 54 0 

ctnccaagta gctgtgacta aaggtgcacg ccgccatgcc canctaattt tttggatttt 600 

aagtagaaac cgggtttcac ccgtgttgcc caagctgctc tnaaaactcc tgagctcaag 660 

cagtccaccc gncttngcta ccggantgct aggattcaga cgtaagcccc cgaancctgg 720 

ctagtttgcn ttnttttctn tcattttata ag 752 

<210> 689 
<211> 806 
<212> DNA 
<213> Homo sapiens 

<400> 689 

gtgntttcta atgcttctaa tngcttggct actcgttctt tntgcaggat cccatcgatt 60 

cgaattcggc acgaggannt ctntgctatn gaacagnggc tggtnnacac tnnggantta 120 

nnnntgnacn ntannnattg nancanntan tactggnnnt ccntaatncn nttaatgtna 180 

cntnttgcaa gnngnnctga tnaaatacac gacaggaggg aaanctantg cgtcataggc 240 

acaggcagac ctaccgnnta aggagatnat ntnccnnang gntggctgtt gagnncatgc 300 

aactctggna tgtatttccc tttataggac caccttgtnc atngtggata aagcccctaa 360 

agnaggatgn naaagatgat cngatccaat acgttacnct gacannaaan nntgtnatac 420 

ntcngctgan caatctntcc ancnnntnta atatcgtgna tcacctaggg tgtatgatcn 480 

taggaactct gcncctncan tcnggactgt ccatcacnga ctnntgggct nctactgtac 540 

antangcgna gaanancnnt cannctacan ntaaccagat tggtgctgnn anatggtant 600 

gcnntttnan cncccacgac ncaataaagn ncnnctntnc cccanancct ntnnagggaa 660 

gaaaggaatt ttncatagtg ggctcaatga anggggtacc cttggncttt ntaaaaaacg 720 

ttncatggnn cctaccttaa acctgngtna actnanancn nttngncata angggtctaa 780 

cgnctatang gggnacnnat ttttnc 806 

<210> 690 



210 



WO 99/33982 PCTAJS98/27610 



<211> 772 
<212> DNA 
<213> Homo sapiens 



<400> 690 

ntntttgaat ctttgaaata ectttgctat agttctttat gcaggatccc atcgattcga 
t"!^ aC f a9a " ttgCt ca ^tgaagg agcacaggag ggttttccag gccatgtggc 
tcagcttcct caagcacaag ctgcccctca gcctctacaa gaaggtgctg ctgattgtgc 
atgacgccat cctgccgcag ctggcgcagc ccacgctcat gatcgacttc ctcacccgcg 
cctgcgacct cgggggggcc ctcagcctct tggccttgaa cgggctgttc atcttgattc 
acaaacacaa cctggagtac cctgacttct accggaagot ctacggcctc ttggacccct 
CgtCaagtaC c 9 c 9<~cgct tcttccacct ggctgacctc ttcctgtcct 
cctcccactn cccgcctacc tggtggccgc cttogccaag cggctggccc gcctggccct 
gacggctccc cctgaggccc tgctcatggt cctgcctttc atctgtaacc tgctgcgccg 
gcaccctgcc tgccgggtcc ttgtgoaccg tccacacggg cctgagtttg gacgccgacc 
cctacgaccc tggagaggag gacccagccc aagaccoggg cctttggaaa acttccctgt 

gggaagcttt aagnnccttc nanangccac ttacccaacc r.f. as , rmrmT .*. 

gccanccggt nattaaccaa ggccctggnc aatgcctgaa ggtcaaacaa tn 



<210> 691 
<211> 755 
<212> DNA 
<213> Homo sapiens 



<400> 691 

ntgctttcna atctttntaa atgcctttgg cttctogntc tttctgcagg atcccatcga 
ttcgaattcg gcacgagaaa aagtaaagct tttoatgagc acaaatncct tgcattgttt 
gatgttactg atattcgtaa aatgaatatt ttttgttttg ttttgtttta tttttttgag 
acaagtcttg ctttgttgcc caggctggag tgcaatggca tgatcttggc tcactgcaac 
ccctgccttg cgagttcaag tgattcttct gcctcagcct cctgagtagc tgggattaca 
ggcgctcacc accacaccca gctaatttct gtatttttag tagacacagg gttttaccat 
gttggccagg ctggtctcaa actoctgacc tcaaactcct cacacctgta atctcagcac 
tttgggaggc tgaggtggaa ggatcacttg aagccagagt ttgagaccag cctgtgcaac 
catlap CC f gt f Cta caaaaactta aaaaattagc tggctgtggt gttgctcacc 
catagttcca gctactcggg aagctgagca ntaagatcac ttgagcccan gaggccnatg 
«ST?? a a f gtgattg ^ccantac agnccacctg ggtgacanag taaanaaaan 
gaaacattac ataatttggc tagagcataa taaattgatt tctgggttnt gaaattnnag 
ttgccataaa aggnntttna atgngcnant tcant 



<210> 692 
<211> 748 
<212> DNA 
<213> Homo sapiens 



<400> 692 

tgnttttaat cnttctaatn cttggctctt gttctttttg caggatccct cgattcgaat 
tcggcacgag gtccgaagaa aaagactgtg gtggcggaga tgctctctcc aatggcatca 
agaaacacag aacaagtttg ccttctccta tgttttccag aaatgacttc agtatctgga 
gcatcctcag aaaatgtatt ggaatggaac tatccaagat cacgatgcca gttatattta 
atgagcctct gagcttccta cagcgcctaa ctgaatacat ggagcatact tacctcatcc 
acaaggccag ttcactctct gatcctgtgg aaaggatgca gtgtgtagct gcgtttgctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
772 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
755 



60 
120 
180 
240 
300 
360 
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tatctgctgt 
agacttatga 
atcacccacc 
gctctatcta 
catcaccttg 
gtgcataata 
accncagact 



tgcttctcag 
attagtgcga 
aatcagtgca 
tcccaaactg 
gagctncttg 
tcattgnggg 
ggggacaaat 



tgggaacgga 
gatgaccttg 
tttcatgctg 
aaattctggg 
aacacaatga 
taaactgtgg 
ntgtgttg 



ctggaaaacc 
gatttagact 
aaggattaaa 
ggaagagtgt 
ggcatataca 
atcgaacagt 



tttcaaccca 
catctccgaa 
caatgacttc 
agaacagaac 
tggacaaatc 
ntggcaatgt 



ctgctgggag 
caggtcagcc 
atctttcatg 
ccaaaggaac 
cacctgctgt 
ggaaattnta 



420 
480 
540 
600 
660 
720 
748 



<210> 693 
<2il> 881 
<212> DNA 
<213> Homo sapiens 



<400 
tgnnnngtna 
aattcggcac 
cgtgctgacc 
ccgtctggca 
ctgccttcgc 
gcacgttttn 
tggtggcgta 
ctttcaggac 
cngngaggcc 
atgcctnagg 
ccattcttga 
ntatctantt 
tagnaattca 
taaanattnt 
atnncannnc 



> 693 
accagggaaa 
gaggcggtga 
atctggcagt 
gacacaaaac 
caattaccgc 
gtgcntgcac 
tgggcttgtg 
aggtcaanca 
naangaggan 
gacatatcan 
aataagantc 
antaaaacca 
aangntgtgt 
tatgcantrxc 
nataatntag 



agctnngttt 
cccacgtgtc 
gntcttcgta 
aatgctctct 
ctgtgggagg 
gtcaagctct 
gantgcgtgg 
agtcagagga 
naantnataa 
ccggggttct 
nttgnttnaa 
caaatntgtt 
nnttacnaat 
ncacaatnta 
nanatntntn 



gaactccttg 
cttttgattg 
ttctctggcc 
acggcgttct 
ccctgggctt 
acattctgct 
agtcccaaga 
tgaagaanat 
aagcaccagc 
cattacccat 
ttntcaactt 
nncnatnacc 
ntntaanggg 
tataanangg 
nnccntgtan 



ggcatgatcc 
ccctactgct 
tgtggggcgt 
gtttganaag 
cgtcattgcc 

gggggtccng 

accccgaatc 
tcanacaaan 
cagaagaatt 
cttaancncc 
ctttttatgg 
accanttctt 
ttattcaaan 
tcctnaaaac 
n 



catcgattcg 
gtggagacct 
ggcaagatgc 
agcaaggaag 
ttcnggtaca 
agcctgacca 
anacc enact 
atgtgatanc 
ttcttanaan 
anatttngnc 
tnatttcnat 
ttaaaccatn 
ttcnaaattt 
gngnnccaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
881 



<210> 694 
<211> 742 
<212> DNA 
<213> Homo sapiens 

<400> 694 

atngcttggc tctngttctt tetgeaggat cccatcgatt cgaaaattta tagtaatgac 60 

aaatgactta tcagtgttca tcatctgaaa gctaagtggt tegttcaate actttttcaa 120 

agttgatagt agattgeatg gtttcatgtt tcctcatatt ggtttattaa ttctatttaa 180 

tcaaggaaaa taacttcaga ttccataaag tttcagttta tttttagttt actactaggt 240 

gagatagcac attacatact tttactatca aatattattt tagcagcttc ccatagtacc 300 

aaatgatttg attccctact ctcatttttt aaagcatata aatatttatg ggcttaaaaa 360 

gggggttttt aaaaactgag gatatcanta ataaattgea gaatattttg caaagctttc 420 

ttttggaaag caaacttttg tgcctgccta tatgenaagt attttatcag ggacttgaac 480 

aaagacctca ctctttttca cttgtcttat gtcgagagaa aaggttattg gcagncacat 540 

tcctaanact ggggaatggt gtgtnctttt naaatttgaa gataactttt agggtaatta 600 

tggaaactcc tcaaangagg ganaaagtna tttttttcca gacatttttc ctcaattctg 660 

ggtctttcac acactanntt tccatagtnc nagaatttct gnntttttac catttgggct 720 

gtgaaatgtt cacaatntcn ng 742 



<210> 695 
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<211> 745 
<212> DNA 

<213> Homo sapiens 



<400> 697 

ctaatagctt ggctactcgt tctttntgca ggatcccatc gattcgcagc cctcttccct 
cccctgtcaa gtcacttacc atgcaaacca caggctctaa gagtttgtcc ccagggacat 
caatef^ gctcctgggt cccctggtga gcatggagtc aggaggtcat 

caatcatcat gctggggttg gtgcgagagg ggccacagac ctgaaaccaa atggatctga 
ctggggcagc tgcccctcag tgtcagaggg gctcgacccc tccggtctct aaggaagtcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
745 



<400> 695 

tttcaaatng cttggctact tgttcttttt gcagggatcc catcgattcg aattcggcac 
gaggctagac gaagtggtga agcccaaaga cttatttttg agctcgctgt aagactgaga 
aatcacgtag tccttcctga aacoactaag aggaaaaatg tctgtgacac tgcatacaga 
tgtaggtgat attaaaattg aagtcttctg tgagaggaca cccaaaacat gtgagatgga 
gtctcgctgt gtcccccagg ctggagtaca atggcgcgat ctcggctcac tgcaacctcc 
gcctcctggg ttcaagcaag tcttctgcct cagcctcccg agaaotggaa gaggaggcaa 
cagtatttgg ggcaagaagt ttgaggatga atacagtgaa tatcttaagc acaatgttag 
aggtgttgta tctatggcta ataatggccc gaacaccaat ggatctcagt tcttcatcac 
ctatggcaaa cagccacatt tggacatgaa atacaccgta tttggaaagg taatagatgg 
tctggaaact ctagatgagt tggagaaagt tgccagtaaa tgagaaagac ataccgacct 
cttaatgatg tacacattaa gggccntaac tattcatgcc aacccatttg ctcagtagct 
attgatngan ctggacaaat tactttgncc aaattgctng aacacacttt attacraoaot- 
taccccgntt ccaattatgt canaa 

<210> 696 
<211> 795 
<212> DNA 
<213> Homo sapiens 

<400> 696 

tttcaaatng cttggctant ngttottttt gcaggatcoc atcgattcga attcggcacg 60 

aggctggcca aagccaaatc tcctaagtcc accgcccagg agggaaccct gaagcctgaa 120 

ggagttacgg aggccaaaca tccagotgca gttcgcctcc aagaaggggt ccatggccct 180 

agtcgagtcc atgtgggctc tggggaccat gactattgtg tocggagcag gaccccccca 240 

~™*o^ 9C Ct9 ^ CCtagt °*ttccagag gtgggctccc gatggaatgt caagcgccat 300 

caggacatca ccatcaaacc tgtcttgtcc ttgggcccag ctgcccctcc gcccccatgc 360 

atanctgcct cccgggagcc gcttgatcac aggactagca gtgagcaggc agatccctca 420 

gcaccctgcc ttgccccatc cagcttgctg tcccctgagg cctnaccctg ccggaatgac 480 

atnaacacta ggactncccc tgaaccctca gccaagcanc ggtcaatgcg ctgttacccg 540 

aaaaagcctg caggtcaagc cagcccctta agocagggct tggcangggc ccgccnaagg 600 

ccgnaacaag accgntctgt naactcttgg gttccaaacc cggaactttg cccgaaagca 660 

tttntttccc ttaattcctt oaattcaatc cggnctttcc ttaatttccn ggattcttng 720 

Talllllll ttnnr"" tCCCCCCCa * — aagaa aaggttgggc ccgaaangg^ 78 0 

795 

<210> 697 
<211> 734 
<212> DNA 

<213> Homo sapiens 



60 
120 
180 
240 
300 
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caaagagaat 
gctgggtggt 
ccttgatgtc 
tgtgatatgg 
gaattcccac 
ttttcccgtg 
gttttccaat 
ggtctctaac 



gctctgtggg 
ccgagcgact 
ttcctcatta 
tntagctgtg 
atgttgtggg 
ctattctcgt 
ttttggttct 
antn 



tccctagcat ctgaggagga cgggctcctt cagaactcgg 
catgatttgc atgggactct ggcaatctgt agccccaatg 
acactgtcac gtctcaccag gaatacagtg acattaaaag 
cccccaccca catttcaact tgaactgtat ctatctccca 
anggacccag ggggaggtaa ctgaatcatg gnggctggtc 
gaatngtgaa ntttnacgag atctgatggg tttatcaggg 
tatttttctn ttgcaatctg catttaagna antgccnttn 



360 
420 
480 
540 
600 
660 
720 
734 



<210> 698 
<211> 728 
<212> DNA 
<213> Homo sapiens 

<400> 698 

ttcnaatngc tnggcttttn gttctctttg caggatccca tcgattcgaa ttcggcacga 60 

ggtttaattt aaacctctca tcttttttta agcactcact gantttgacc gagacagcca 120 

gtcgccgttg aggaatcctc tgttgtcaac atcgagaccc ctggttttcg ggaaacccaa 180 

tggtgatgca gttgattatc agaaacagct gaagcagatg attaaggatt tagccaaaga 24 0 

aaaagataaa actgagaaag aattgcccaa aatgagccag agagaattta tccagttctg 300 

taaaactctg tacagtatgt tccatgaaga tccagaagaa aatgatttgt atcaagccat 360 

cgccacagtc accacactgc tgctgcagat cggggaggtg gggcagcgag gcagcagctc 420 

tggaagctgc tcccaggagt gtggggagga gctgcgggct tcagctcctt ctcctgagga 480 

ctcggttttt gcagacactg ggaagacgcc ccaggactcc caggcatttc cagaggcggc 54 0 

agaaggggac tggactgtct cccttgaaca tattttagct tcacttctga ctgaacagtc 600 

attagtcaac tttttttgaa aagccactgg acatgaaatc caaacttgaa aatgccaaga 660 

tcaatcagtn caatctcaaa cttttgaaat gaccncaatc caatctggac ntaagctgag 72 0 

tacttgtn 728 

<210> 699 
<211> 746 
<212> DNA 
<213> Homo sapiens 

<400> 699 

tttcaaatcn cttggctntt ngttcttttt gcaggatccc atcgattcga attcggcacg 60 

agggaaaaac aacaggtttg agtcctataa agccataatt taactccagt agctgatgtc 120 

agacaagctt gtcctatgtc ctatttgagt ggcagcagcg ccagcccagc aagaaggctg 180 

ggggttgtca aggttgtccc cagaccttgc ttgcagtggt tggagaaccc agggggctgc 24 0 

cttgggccct ctggccagag ggaagcgggc agctctagcc ctggagattg tggtcacatt 300 

ggggcttgtt taggattgga gggccaggtc acctccccag ccaccctccc ttctctcctc 360 

tggggtcccc actttagggc gactttgccc gagcccacgc atccatccac tcctttagtg 420 

ccttgaatct cattcacaag cagccccctc ccttcccctc cccttctcac tctgttgatg 480 

taatcctncc acccccagtg tccatcctaa gacaggcatc aaaaagaggc cctaacttta 540 

cttnccaaat ggtgcttttt aaaaaacacc atcactacat tangggcaat tttttcacac 600 

cttcctgtct tcagaatgta aaagggtggg ggaattattg tctctggtta aatntgcacn 660 

cccttgactt gtgggggttt tggggcatgt tcanntattt angaatgaat tncaattnga 720 

caaaaggggg tttantnaat tgttnt 746 



<210> 700 
<211> 759 



214 
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<212> DNA 

<213> Homo sapiens 

<400> 700 

gntttgaaat ccctttgctt tnaaatcctt tgctanttgn tctttttgca ggatcccatc 60 

gattcgaatt cggcacgaga taagggtggg gccttaattc agtagaattg gtggcctcct 120 

aagcagagga agagagattt ttctttctct ctctgccatg tgaagacagt gaggagtcgg 180 

ccgtctgcaa gccaagaaga gcccttatca ggaacagact tggctagcac cttcatcgtg 24 0 

gacctccagc ctccagaatt gcaagaaaat acatttccgt cgttgaaacc acccagtctg 300 

tggtattttg ttatggcagc ccaggcagac taatacgtga agcctgctct aaatagataa 36 0 

aataagaaat tactacagag ggctctttag aaattgtatt taaaaacaag acaatccata 420 

tttacctaag atttacagaa tgtatgtcta taaaaggagg gatttctgga ctagatgatg 480 

atgaaaaatg ttcatataaa ggcaccttca gcttcgagtt gccaacacag gaggaagaat 540 

gctccctgct gttcagatgc tgatatgtgt cctgtgcttt ctggatggcc agtgggatca 600 

taagctggta gaagccagaa ctttcatcca ctgacttcat attcttncac atnctggaac 660 

tgtgggtgtt tgacctttta aaaaataaat ttaagcaaat tgaaatgntt tcctttgaga 72 0 

nttttggcca naaacccaca tnganatttt ncgtctncc 759 

<210> 701 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 701 

gcttnnaatt ccnttccaaa gnaaaccctt tgnaaattnc cctttctgnt tggatcccat 60 

ccgattcgaa ttcggcacga gggtaagtca ggtgattgaa tcccggaant nttcattgtc 120 

ttcaagctca caatactatt ttgggacaaa cagttgtcta gtgtttggac tcatgaaccc 180 

tgattcttga gggtggtatt ttactgcttt tgtgatttgg tttcaacata tatagtcttt 240 

tctccggagt taccttaggt cagtggccag tgtttcagcc cctggaaagg gcatgggctg 300 

ccactgaggt tggtcacagg cctctcagct catggtggga gtgggttcag gagttggtaa 360 

gtagggttca gttctgttgt tgccaccgat ggcaacaggg gtttgtaata atccctagtt 420 

gtgtcaatta tgtcacttaa ttttcacaac aggtctctga agtgtttctc atctcatttt 480 

tacagatgag gcctgcctgt gttaatacac ctagtgagga gtggagctga atttgaatgc 540 

aagccttggc accttaattg agcaagtttg aaacctcgct tgttgccctt ctggaaggag 600 

tcangaattt ncagttctgg gcctgggctg tgggtctggc agacagacct ctggccctaa 660 

ggtttgggtn ccangttctc tgcttccaga atgagaagct ttgctgtgca ccaagnanct 720 

gggcccctct ggnatctcnt gaatnaaaan n 751 

<210> 702 
<211> 748 
<212> DNA 

<213> Homo sapiens 
<400> 702 

gntttgaanc ccctttnntt naaatccttt gctacttgnt ctttttgcag gatcccatcg 60 

attcgaattc ggcacgagcc tgaatataaa gaggaggagg aagaccaaga catacaggga 120 

gaaatcagtc atcctgatgg aaaggtggaa aaggtttata agaatgggtg ccgtgttata 180 

ctgtttccca atggaactcg aaaggaagtg agtgcagatg ggaagaccat cactgtcact 240 

ttctttaatg gtgacgtgaa gcaggtcatg ccagaccaaa gagtgatcta ctactatgca 300 

gctgcccaga ccactcacac gacatacccg gagggactgg aagtcttaca tttctcaagt 360 

ggacaaatag aaaaacatta cccagatgga agaaaagaaa tcacgtttcc tgaccagact 420 
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gttaaaaact tatttcctga 
agagtacaac gtgatggcaa 
actgcccagt tcaagagacg 
gtcatcaaga aacgaagtac 
tgctaatgga cccgaactgt 
ttatgttaaa aaatgtccat 



tggacaagaa 
caaactcata 
ggaatcccag 
agatccngtc 
gacgatcctc 
ttactgng 



gaaagcattt tcccagatgg tacaattgtc 
gagtttaata atggccaaag agaactacat 
atggcactgt taaaaccgta tatgcaaacg 
ggataagagt taanggcaag gagggtaatg 
atgtgatcat gaagtaccag tactgacttt 



480 
540 
600 
660 
720 
748 



<210> 703 
<211> 769 
<212> DNA 
<213> Homo sapiens 

<400> 703 

ggnnntnnna gnntttgaan tccctttnnt tctaatncta ggcttctngt tctttttgca 60 

ggatcccatc gattcgctca gctgaggcaa ttaaactgga aaagaaatag attgaaaaga 120 

tactacagaa gaagcagtac agaagttggg ggactgaagg agagggagcc actgcaggtg 180 

ctagctgctt aaggggatac cagtcctttt acagatataa tagatacagc ttctgaggtg 240 

gagggtgata ggagtgtgta gagaaattgc agttcagaac tggagcatgc agttaggcaa 300 

gaggcatccc atgtgaagat gtcaagcaag tactggaaaa tgctgaacta aaactcaggg 360 

atggatatgt agatttagag aacttcattg tagaggcagt cattgaaagc taaaagggct 42 0 

gataataaaa ttgccaagga tggaaatagt aagagggagt cagtgttatt aggattagaa 480 

ttctgttttg ttttttcttt aaacagattc tcgctctgtc accctggctg gagtgaagtg 540 

gtgtgatctc ggctcactgc ggcctcgacc tcccaggctc aagttatcct cccaactctc 600 

agccttccaa gtagctggga ccacagccat tcaaacacat gcctgcctta tgtttggatt 660 

tttttgtana aaccaaggtt ttgccatgtt tnccaggctg gnctnngaac ttctgggctt 720 

aagccattcc cccacccttg ggtctcccaa aatgctngcc attatangg 769 

<210> 704 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 704 

cnaannncnn ggnttcnaat annaggctac ttgttctttt tgcaggatcc catcgattcg 60 

aattcggcac gagacccgtc cggggccggc caatttgcat atttggaatg cgccgctata 12 0 

aacccggctg gggttttgca gcgatttctt agatgtaaaa atgagatctc aatagcagcg 18 0 

ggctgggcac atcctctcct ctctccttct ctctctgccc ggagctggtt tccgtctctc 240 

ggctcggggc tggaactccg gcccaaccta ggcgcgcagc cgccacgaga tggcgcactt 300 

ccgatcaatg tcaaagccgc cggggagccg ggaaccccag catgattctt ggcctttgtt 360 

cgcttctgat actaagagca gcacggtaca ttatttcact tgtcccgctc cccttcataa 420 

cagaaaaagg ggactcaccc tcaagaagtg attggtatgg taatttaaag caacgcgcat 480 

tcgctaggcc tcgcgagcgt cgccgcgcgg agaagccagc tgtcccttgg cagtgatttc 540 

ggaaatgtgt caaggcaatt ccaaaggtga aaacgcagcc aactggctca cggcaaaaga 600 

gtggtcngaa aaaagcgctt gccccttaca cgaagcacca gacactggag ctggaagaan 660 

ggagtttctg ttcaatatgt acccttactc gaaaagcggn gcctagagaa taacccgcan 720 

cgttccacct taacggacag gacaagtgga aaaatcttg 759 

<210> 705 
<211> 777 
<212> DNA 
<213> Homo sapiens 
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<400> 705 

tttgaaatcc cntttnttna aatcctttgc tncttgttct ttttgcagga tcccatcgat 60 

tcgtcctgaa gctcgggggg ctgcaggtcc tgaggaccct ggtgcaggag aagggcacgg 120 

aggtgctcgc cgtgcgcgtg gtcacactgc tctacgacct ggtcacggag aagatgttcg 180 

ccgaggagga ggctgagctg acccaggaga tgtccccaga gaagctgcag cagtatcgcc 240 

aggtacacct cctgccaggc ctgtgggaac agggctggtg cgagatcacg gcccacctcc 300 

tggcgctgcc cgagcatgat gcccgtgaga aggtgctgca gacactgggc gtcctcctga 360 

ccacctgccg ggaccgctac cgtcaggacc cccagctcgg caggacactg gccagcctgc 420 

aggctgagta ccaggtgctg gccagcctgg agctgcagga tggtgaggac gagggctact 4 80 

tccaggagct actgggctct gtcaacagct tgctgaagga gctgagatga ggccccacac 54 0 

cangactgga ctgggatgcc cgctagtgaa gcttgaaggg tgccaaccgt gggttgggct 600 

ttcttaagca tggaggacat ttttggcaat gcttggcttt gggccattta aatgggaaac 660 

cttgaaaggc caaaaaaaaa aaaaaantna tntnaaaaan aaacttnnac cttttaaaac 720 

ttttaantgn ngnccgnttt tacnttanat tccagacttg attaggaatc cattttt 777 

<210> 706 
<211> 760 
12 > DNA 
<213> Homo sapiens 

<400> 706 

gntttgaaat nccnttnntt caaatnctng gctacttgtt ctttttgcag gatcccatcg 60 

attcgaattc ggcacgagna atgcaaaggg ctgcagttct cattcaggct actttcagga 120 

tgcacagaac atatattaca tttcagactt ggaaacatgc ttcaattcta attcagcaac 180 

attatcgaac atatagagct gcaaaattgc aaagagaaaa ttatatcaga caatggcatt 240 

ctgctgtggt tattcaggct gcatataaag gaatgaaagc aagacaactt ttaagggaaa 300 

aacacaaagc ttctattgta atacaaggca cctacagaat gtataggcag tattgtttct 360 

accaaaagct tcagtgggct acaaaaatca tacaagaaaa atatagagca aataaaaaga 420 

aacagaaagt atttcaacac aatgaactta agaaagagac ttgtgttcag gcaggttttc 480 

aggacatgaa cataaaaaaa cagattcagg aacagcacca ggctgccatt attattcaga 540 

agcattgtaa agcctttaaa ataaggaagc attatctcca cattagagca acagtagttt 600 

ctattcaaag aagatacaga aaactaactg cagtgcgtcc ccaacaagtt atttgtatac 660 

agtcttatta cagangcttt aaagttccaa aaggatattc aaaaatatgc caccgggctt 72 0 

gccacactta attcagncat tctatcnaat gccccagggc 760 

<210> 707 
<211> 856 
<212> DNA 
<213> Homo sapiens 

<400> 707 

gttgctttga agcctttgaa atncnttgtt tnaaatnctt ggctttngnt ctntttgcag 60 

gatcccatcg attcgcctcc ctggatgtgc agacatggag gaggacagaa ggcccagctc 120 

agtggccccc gctccccacc ccccacgccc gaacagcagg ggcagagcag tctggaggtg 180 

gtgntcccac ttgatgaaga gcaggcgact ggnttggaga gggagatcat gctggctgna 240 

aagaanggac tggacccata caatgtactg gccncaaagg gancttcagg caccagngaa 300 

gacccaaant tantncccta catntccaac aagagaatag naagctgcat ntgtgaanag 360 

gacaatacca gcntcnantg gttttggctn nacaaangcc angnccancn atgcccccnn 420 

tttgnaaccc attacaanct gntgccccan tagctggcac actgancncc tnntctaaat 4 80 

tacttaaaat natgctgtan aagtatantn tttncagaan agactaanca ntncatngnc 540 

tacttctcca aaaaaaantg anaaaaatna taaaantcaa antaaatact aaatnannan 600 
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ataanancan tannaantta tatttcnnan atantanann nancnnttta naannantta 660 

nggnnancan nnattantnn tnnatanntt acattaaant tattnanann anaaannnan 720 

nananannat atattannan anantnacnt aaactnnnnt naatnntcca nanacttnaa 780 

naanaataag nnntanatna nnnnttangn ntnatatann ttnanatann nnnnacnata 840 

nnacatnnnn tannga 856 

<210> 708 
<211> 766 
<212> DNA 
<213> Homo sapiens 

<400> 708 

ctaatactgg ctacttgttc tttcnaagcn ctggnntttn annnatnnag ctacttgttc 60 

tttttgcagg acccatcgat tcgcccaaac ttatcggggg tgccagaggc agagtagaca 120 

agccttagtg gccgccattt gttgaatatc tactgtgcgc caagcagtgc gtcacaactt 180 

tatgaagtag gtattattat catccccatt ttacaggtga agaaactgag tctctgagag 24 0 

accaactttt ccaaggtcac acagaggtgg gatccagccc acttccgtct gaccccaagc 300 

ccctgctgtt aacccctgcc ccattgtggg gaggttccgg cccactctgg agttctctgg 360 

tctgcgtcag tcctcaggag aagaaagaat gggggtgatg ctccaaatat tgaggctccc 420 

atctgtctgt cctgcactag gcagagccag gcttctccat ggggcacagg agagagggca 480 

ccagatctga ggagcaaata ggttcttggt ctgagatctc atgggatcag gttgccagcc 54 0 

ctgcaaaccc ccgctcangt ctagaggaca tggagctgcc tttcaaggtg catttgcttc 600 

ctttacagac tcggactctg tnctctggct actttgggcc gtcccggact cgggaatgcg 660 

tnctacactt gtaggggcaa aaccccggtt tgactctttc cgggttccta cccttaacca 720 

agcctttact ttctngggat caccctgttg ggactttttg tccacc 766 

<210> 709 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 709 

gaannccntt nnnttgcaaa tnntnggcta cttgttcttt ttgcaggatc ccatcgattc 60 

gaattcggca cgaggttttt tttttttttt tttggagaat gaatgcaaga tttattgagt 120 

ggtggaagta gctctcagca gatggctggg gagccagaag ggggatagca tgggaaggta 180 

gtcttcctct ggagtctggc tgctcagcag ccgggatctc ctactgtcct tggccgaatt 24 0 

tcccttggcg tccgaatcgt tccaccatca atggcctgcc agcgtctttc gatgtgttct 300 

tctgccagtg tgttcctctt gacgtccagc cgcttgtgtg tgtgcccgct ggggtctcag 360 

ggtttttata ggcacagaat gggtggcatg gcaggccaga gtggtcttgg aaaatgcaac 420 

atttgggcaa gaagacagga gtccttgttc tcattaggtc catgggcaca agcctgaggg 480 

tggagccctt gccagtgacc ctgcccttct ctacccagca cttccctgtc cccctcccat 540 

atcaccgttg ccatcttgtc cttgatgagg aatacaactc ccaattcagt gnttgcttgt 600 

gggaagatgc aatcctcttt atgacaagtt tctaanaagt tgataagaaa aatggggacc 660 

tgcctaaggg ctagtatctc atttaatact ctatagaata ttatgnggtt ttccctttta 720 

ngttttaaat gttgaananc nan 743 

<210> 710 
<211> 753 
<212> DNA 
<213> Homo sapiens 
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<210> 711 
<211> 718 
<2i2> JJJNJA 
<213> Homo sapiens 



<210> 712 
<211> 783 
<212> DNA 
<213> Homo sapiens 



<400> 712 

agttgaantn cttgctacnn aaaacctttg gcnactngct ctttntgnag gatcccatcg 
attcgcaaag atggtcgtat tactaaaggt gaataaccag cgcggnnngc acgtggagtc 
actggaacat ttgtgcaatg ctggtgggaa tgtcaacccg tgcggccctc tggaataagc 
ctggcagctc ctccaagagt taccgngtga cccancaatt ccactcctag ctccacccac 
aggaattgaa agcaaanacg caaacagatg cctgtncacc aaagttcacg gcagcatnct 
tcgncatagt ggcagcatcc gtcgtcacag cggcatcatc cttcatcata gcggcagcat 
ccgtcgtcac aagcggcagc atccttcgcc acagnggcan gcatctgtcg tcacancggn 
agcatccttc gacaaagcgg cagcatnctt cgtnatagcn gcagcatcct ttgccatanc 
cggcaaggtg gaaaccctgt ccatccactg aggcgtgcat agactaaaca tgggcagtcc 
agcactggaa ttccaagccg tacaacggng nccacngtca aaaangaatg aggaccctga 
ngcacctgng cnganaacaa gaacnngcga nnccaanact tttnagacat tattgcctta 



60 
120 
180 



<400> 710 

gnnnnnnnnn nagngtttga antcctcctt ngaaatcctt tggcnactcg ctctttntgc 
aggatcccat cgattcgaat tcggcacgag gggcaatgca gttataatac tgtgttaatt 
tcagacatct tctggtcctc cgagccttgt atttacatac tagctgaaac tgcaagtgga 
aatgaatgga gctgatgata tttgccttat cctaattttt ctgtgaggag gagaaaaaca 24 0 

cttgtgcttc aaataagcag atgtgaaaac acttctcact aatcaaaatg tttaccacta 300 
ggttatgaga gtctgcctct cataggcagt gaatctgata tgtatactta gtaatataag 
tctatttagt ttgacaaaac cttagagcag aatttttgca gcttagttca ggatgatcac 
tagcaatgcc aaacttcatt ttttattgaa cttggatcca agaaggcctg ctgtgtctat 
ttcagtatag actctcatac caatatattt atgctccaag tcactacacc cagaagtgat 
gcagtggggg aaatgcaaag acaacatcac tgtaagattc acagaatgga tcttttgtaa 
aatattttat attgacttaa ggaaaacctt tcattgggaa ttaattaaat taagtctcta 
atatcctgga agacagtaaa aantnaagcn ggtgntctca antttgaacc cggcnattng 
naatttcatt ataggaattt ctgaaaataa tec 



360 
420 
480 
540 
600 
660 
720 
753 



<400> 711 

naatngctag gctacttgtt etttttgeag gatcccatcg attcgaattc ggcacgagcc 
tacttattgg atgttggctc tttggtgtca tggagatggc tttactgtag gtttgttgtg 
ttgeattact tttcattggg attgaactga gaaataacaa acaagcttta agtgggaaat 
taaaaaaaag aagtaaccta tgtagatcca aacttaaaat gtgagaaatt attgaaattt 
cattttctac aaacttgaaa ttagectget aattgtaaag ttgttttaat aatgetgaca 
aatgtcagtt acgtttgcaa aggagtgtat ggttctaggt atttgectae tgttaaccgt 
tgagaaaaac attgtcaggt tagcaagtct attgaaatag agacctcctt agtttacagc 
aaagaataaa tagctgatga ctggagattg ggactaaggt tttatttatt tatattcttt 
gaaagaaatc ggacagttaa taagtggttt gtggtagagt tgaaggatgt ctgagagatg 540 
gaaagagagt gacaaaggag gagaaggaat agtatttctt ttttagtatt gntttgaaat 600 
taaaactctg ntattttaat atggtaaaga gcaagaattt gggttgggcc gengtgaetc 
aegectataa tcccagcact ttgggaagcc ntggtgggca aatcacctga aattangg 



60 
120 
180 
240 
300 
360 
420 
480 



660 
718 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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agtngaaaaa cccagngcac caacgggaaa 
nantnngttn cccgaaaatg ggggcacntt 
cct 



ccngaccgnc 
nccaaaaagg 



ntgnanccct gnttaacntt 
ggaataaaag gggagaattn 



720 
780 
783 



<210> 713 
<211> 765 
<212> DNA 



<213> Homo sapiens 



<400> 713 



gttgaantcc 
tcgaattcgg 
atgaagagag 
ttttccacag 
ggggagagaa 
cctgtatacc 
tctgcataca 
catccttggg 
tggtaaatcg 
acgcagcaat 
atccccaaag 
gggaaaaaag 
ggccccttgt 



cacgagccca 
aggacatcta 
cctctgacat 
agagacaaag 
gcactttttt 
tcgctgcagg 
gctgtgccgt 
ggtttccaga 
ctgaaatcat 
aagtttttct 
cccacttgnc 
tcaggaaacc 



ttcctttcaa 



atngcttggc 
catgtaccag 
cactttagat 
gcgatttgaa 
agtgctgagc 
gaagagctct 
ccacagagca 
gagttttgct 
gtttggtcac 
ctctagtctt 
aaagtnaaaa 
cacannagga 
acactggggg 



tactcgntct 
gttgagtttg 
gaagagttac 
gacacgtttt 
tccaggttta 
ttccagaaga 
gcttgggttg 
ggcatangtg 
caaaaataca 
gctttccttg 
ggaaaattcc 
agccnggntn 
ancttntttt 



ntntgcagga tcccatcgat 
aagatggatc ccagatagca 
ccaagagagt gaaagctcga 
atggagcaga cattatccaa 
agaatgaata tgtggccgac 
agtgccagaa gagacagtag 
gaagagagaa gatgaaggga 
acagggtgtg tctctgacag 
aaatacaccc aatgaattgg 
tgagcagttg tctttctatg 
tagtggaatt cancccccaa 
ccccttngtt ccggcttaan 
ttttn 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
765 



<210> 714 
<211> 740 
<212> DNA 
<213> Homo sapiens 

<400> 714 

gtttgaannc cttngntttc naatgctngg ctacttgttc tttntgcagg atcccatcga 60 

ttcgccaaaa gcttgtggca aatttgaaat ttctgccatt agggacctta caactggcta 120 

tgatgatagc caacctgata aaaaagctgt tcttcccact agtaaaagca gccaaatgat 180 

caccttcacc tttgctaatg gaggcgtggc caccatgcgc accagtggga cagagcccaa 240 

aatcaagtac tatgcagagc tgtgtgcccc acctgggaac agtgatcctg agcagctgaa 3 00 

gaaggaactg aatgaactgg tcagtgctat tgaagaacat tttttccagc cacagaagta 360 

caatctgcag ccaaaagcag actaaaatag . tccagccttg ggtatacttg catttaccta 420 

caattaagct gggtttaact tgttaagcaa tatttttaag ggccaaatga ttcaaaacat 480 

cacaggtatt tatgtgtttt acaaagacct acattcctca ttgtttcatg tttgaccttt 54 0 

aaggtgaaaa aagaaaatgg ccaaacccaa caaactaaca ttcctactaa aaagttgagc 600 

ttggacatat tttgaatttt tgtaagtgaa agatttttaa actgactaac ttaaaaaaat 660 

agattgtaat tgatgtgcct taatttgcat aaatcataaa tgtatgtcct ctctgtaatt 720 

ggtttaatgt gtgcttgaan 740 

<210> 715 
<211> 708 
<212> DNA 
<213> Homo sapiens 

<400> 715 

tttgcaaatn gcttggctac ttgttctttt tgcaggatcc catcgattcg aattcggcac 60 
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gagggaggct agactcaagc tgtctggaga gtgtgaaaca aaagtgtgtg aagagttgta 120 

actgtgtgac tgagcttgat ggccaagttg aaaatcttca tttggatctg tgctgccttg 180 

ctggtaacca ggaagacctt agtaaggact ctctaggtcc taccaaatca agcaaaattg 240 
aaggagctgg taccagtatc tcagagcctc cgtctcctat cagtccgtat gcttcagaaa 



<210> 716 
<211> 730 
<212> DNA 
<213> Homo sapiens 



ntgcttaatn 



300 



gctgtggaac gctacctctt cctttgagac cttgtggaga agggtctgaa atggtaggca 360 

aagagaatag ttccccagag aataaaaact ggttgttggc catggcagcc aaacggaagg 420 

ctgagaatcc atctccacga agtccgtcat cccagacacc caattccagg agacagagcg 480 

gaaagacatt gccaagcccg gtcaccatca cgcccagctc catgaggaaa atctgcacat 540 

acttocatag aaagtcccag gaggacttct gtggtcctga cactcaacag aattatagat 600 

tctaatctga tgagttactg agctttggtc ccttaaaaca agctgacttg gtccctaaac 660 

cagatgaaaa tccagatgct ctatacttgg ctttaagaac tgctttcn 70 8 



<4UO> 716 

ttgcaaatng ctnggctact tgttcttttt gcaggatccc atcgattcgc tcccatggag 
gtggtgggaa tggcaccgag aagtttgatg acagttatct aatggactag aggttggcaa 
actttctgta aatggccagg tagtaaatag ttctgctttt gaaggcatat ggtctcttgc 
acctactcga ggctgaaagc agctatagac aatacataaa tgaatgagcg tgagtgtgtt 
ccaataagaa aaaaacatgg ctgtttgctt cggccccagg gttgtagctt accagtcctg 
taacagatca cagtttgctc ttttggtcac aaatacttga acccctccct agttcagagc 360 
atttgatacc gtaatattta aagctcactt gtaaaacatc gtttgttgcc tccatccata 420 
gtatctcaaa cagaatgtct ctcccaaata tacctaaatt ccatattctc tgaagcacaa 480 
ccagctattt tcttgacata cttcctaaca caccccacag ttcacaattt gatctgaaaa 540 
cttgttaagg gaggttcttt ggcatgtgat gccataaaaa gagaggtatg ggctctcctt 600 
taaaaaagag acccttttta tgagactcac aataggataa aagagcccat gcctattttt 660 
^f^f"!" ttcactatat aagacatgca tgcctgnaaa atggttttta attagtatna 720 

730 



60 
120 
180 
240 
300 



<210> 717 
<211> 728 
<212> DNA 
<213> Homo sapiens 

<400> 717 

naatngctng gctcttgttc tttttgcagg atccctcgat tcgctgcagt gagattctct 60 

gcaatgactg gcctcagcaa gggggcagct taggaccctg acatcccagg tcactaagcc 120 

acataggata agtaatgggt ggacagaagc gggaaaggag aagggcaggg cacatgttta 180 

aaacttgaac tttctgaggc taagactgga aaaggaatgg tttcagctga tatatttgga 240 

taccagttga ctatttttag gaaaaaaaca caaatggctt ttaaacatca cagtgtgata 300 

cagtctaact cagaattaga gacaggcaaa acagaactcc atcttaaaaa ataaataaat 360 

aaaataaaat aaatgacatc actttggttc agagctctaa aatggaggga ggaagccatt 420 

ctaaaaagga ctccctacat gacctgcaac ttgaaaaaaa attaaaagct ccaaaaaaaa 480 

caatncagga gcttaccttg aaccttttga attgggccaa attgccgatg accactgcat 540 

cctggaaaat tttatttcac cagcactaca acttctcaac agcaccaacc aatttaacta 600 

tggatttttg tactaanocc agttgcctct ttnaaaacaa cttgtcaact ttgtctaatc 660 

accctoagct tttttttaaa aacccctnct ctacccctnt ctcttcagaa caccaaagtg 720 
gncttttn 33 

72 8 
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<210> 718 
<211> 730 
<212> DNA 
<213> Homo sapiens 

<400> 718 

gaantccttn nntttnaaat cnttggctac ttgttctttt tgcaggatcc catcgattcg 6 0 

aattcggcac gatctagata ttgcccaatc gctgcccaca gtgcacatac ctttccacca 12 0 

gtcacatgtg agagggcaga ttttccaaat gctcatcacc acttggcact gtgtggacta 180 

taattttggc cagttaggaa atggcatctc attgttttca tcttaatttg cgtcagcctg 24 0 

attactcatt gaaacttgtg aggttgagaa acttttctta agcttattgg ccattcaagt 300 

ttcctccttt atgaaatggt tgttcatgtc atttgctcat ttttatatta gattgttttt 360 

cttttttcca gctgacttgt aggaactcta catcttatca atattaatca tttatcgaaa 420 

actatttggg tgccattatc ttctcctagt caatgttttt tgtttgtgat atcttttata 480 

atatataagt ttttaatgtt ggcagaagta aagttaatct ttttggctgt gttgtgtgtc 54 0 

ttgtttgatg taaagatagt ttctgtaata gttttgcagt ttgattggtc atctttaggt 600 

cttcaattac aacctgcaca ttcatccctc tatcctcttt cttactctgg ttttctccat 660 

agcacttatc atccaataat atggcatgca cttatttaat ctggtttgca tatatatttt 72 0 

ngctggtacg 730 

<210> 719 
<211> 733 
<212> DNA 
<213> Homo sapiens 

<400> 719 

ttcaaatcgc ttggctactt gttctttntg caggatccct cgattcgctt cagtgcacac 60 

aacaggagag aggagaaaga agaaacgcta gtaattccaa gcactggaat taagttgcct 120 

tcatcagtgt ttgcttcaga gtttgaggaa gatgttggat tgttaaataa agcagctcca 180 

gtttcaggac ctcgactgga ttttgatcct gacattgttg cagctcttga tgatgatttt 240 

gactttgatg atccagataa ttctgcttga ggatgacttt attcttcagg ccaataaggc 300 

aacaggagag gaagagggaa tggatataca gaaatctgag aatgaagatg acagcgagtg 360 

ggaagatgtg gatgatgaga agggagatag caatgatgac tatgactctg caggcctatt 420 

gtcagatgaa gactgtatgt ctgtgcccgg aaaaactcac agagctatag cagatcactt 480 

gttctggagt gaggaaacaa agagtcgctt cacggagtat tcgatgactt nctcagtcat 54 0 

gaggagaaat gaacagcttg accctacatg atgagangtt tgagaaagtt ttatgagcca 600 

tattgatgat gatgaaattg ggagctctgg ataatgccag aatttggaaa ggttctattc 660 

aagtgggaca gcaattcgct ttcnaggaag ttttgaatga ctactattaa agagaangcc 720 

caanaattnt ntt 733 

<210> 720 
<211> 740 
<212> DNA 
<213> Homo sapiens 

<400> 720 

agttnnnttn ntnctnttca aatccttggc tacttgntct ttttgcagga tcccatcgat 60 

tcgaattcgg cacgagaaga gaaggaccta gagattgaga ggcttaagac gaagcaaaaa 120 

gaactggagg ccaagatgtt ggcccagaag gctgaggaaa aggagaacca ttgtcccaca 180 

atgctccggc ccctttcaca tcgcacagtc acaggggcaa agcccctgaa aaaggctgtg 240 

gtgatgcccc tacagctaat tcaggagcag gcagcatccc caaatgccga gatccacatc 300 
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ctgaagaata aaggccggaa gagaaagctg gagtccctgg atgccctaga gcctgaggag 360 

aaggctgagg actgctggga gctacagatc agcccggagc tactggctca tgggcgccaa 420 

aaaatactgg atctgctgaa cgaaggctca gcccgagatc tccgcagtct tcaacgcatt 480 

ggcccgaaga aggcccagct aatcgtgggc tggcgggagc ttcacggncc cttcaccagg 540 

tggaggacct ggaacgcntg gagggcataa cngggaaaca gatggagtcc tttctgaagg 600 

caaacattct gggtctcggc ggccgccanc gctntggcgc cttctgaccg tcgctnctac 66 0 

ttncgncttt tcaaattttt ggnataaccc ccgtgtttgn gtaaaatcca gtttttgttc 720 

cgntaaaaaa aaaaaaaaat 740 

<210> 721 
<211> 736 
<212> DNA 
<213> Homo sapiens 

<400> 721 

nnttnnnttt tnnaaatccc ttggctactt gttctttttg cagggatccc atcgattcgc 60 

atgagtgata ttttggtctg ggtttcctct taagatttta gtttgtctga attaaggaaa 120 

aatgttttta atatacaLLc ttatttcgtc ccaccccucc agaaacaagc tggaaatctt 180 

aactttttgg ggggtctttt ttggtgtttt aatgggccca gaactgtggt ttaaattttt 240 

atgtatgtat tttctttttt gtggagtata aatttaaaaa ctggatttgg gacctaaaat 300 

actcctcagg ttgatgtatt catgaagttt taaaacatct ttagttttca aagtaaactg 360 

gatatgtgga ccttaaagtt attgagttta agctacaaat tgtaacgtca ttactggaca 420 

tgtcagcatc aaccctctca aaatagcttg gtcactttat gaaggggcgt tttaaagttg 480 

ttgtttagca gtgacattta atatggtcca attgcttttc tttttaacgt gacaaaaaga 540 

gaataaggaa caaacactat tgctgccgaa tgccataaca ctgagttgtc aaattgtgat 600 

tgaggaaatg aaaaggttta tactttttaa aaaaaaaaaa cnnaanccaa aaaaccaaaa 660 

cttcaaatgg aataaattat tcatgaagcc cttaaaaaaa aaaaaaaaaa aactcgaacc 720 

tntaaaactn tngngg 735 

<210> 722 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 722 

attnccttgg cttttcaaat ccttggctac tngttctttn tgcaggatcc catcgattcg 60 

aattcggcac gagattatag agattaatct cctttgctcg aagtctattt aaatattagt 120 

cacatctaaa acatactttt acagcaacat ctagactggt gtttgaccaa acaactgggc 180 

atcatagctg acacataaaa ttaaccatca caaccatgtt ctaggcactg ttcctcactg 240 

cctgagaaga caccgttatg tttattaggg tttttgagtt ttatccacag cttttggtta 300 

tctgcaacca tgtctcccac cattaacata gttcacactg agatgaggat tccctattta 360 

acacttggtc ccaacttctt cacagtccat ctggttttgt agagggaaca taactggaca 420 

ttctggtcag gttaggtgag gtcaggcctt caggacgcta ttttcactga gttgctttat 480 

aaggcacatt atgcaaaatt ccatcagctc ttctgttcac tacattcact gttgaaattc 540 

taagagtgag actgctgtct cacaccaaag ccagtgggta ctatcttcag taggcacgca 600 

gcatcatgtt tgtatttgat ccanctagat gacatgtaag agaaaacttt attgnggact 660 

ctgtaaagtg tgacattcgt ttgtgactca atttgctcat gtatttggtc ctggggagtc 720 

attacatagc taactttcag ctgctttcaa t 751 

<210> 723 
<211> 749 
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<212> DNA 

<213> Homo sapiens 



<400> 723 

tttaatncct ttcnntaatc cttngtttcn ngcnctttnt gcaggatccc atcgattcga 60 
tgctagccaa agcctgctgc cagctccata gcctggacct acagcactcc atggtggagt - 120 

ccacagctgt ggtgagcttc ttggaggagg cagggtcccg aatgcgcaag ttgtggctga 180 

cctacagctc ccagacgaca gccatcctgg gcgcactgct gggcagctgc tgcccccagc 24 0 

tccaggtcct ggaggtgagc accggcatca accgtaatag cattcccctt cagctgcctg 300 

tcgaggctct gcanaaaggc tgccctcagc tccagcctgg accttgcccc caggtgctgc 360 

ggctgttgaa cctgatgtgg ctgcccaagc ctccgggacg aggggtggct cccggaccag 420 

gcttcctagc ctagaggagc tctgcctgnc gagctcaacc tgcaactttg tgagcaacga 480 

ggtcctnggc cgnctactcc acggctctcc caacctgcgc ttactggatc ttcgtggctg 54 0 

tgcncgcatc acgccggctg gccttcagga tctgccatgt cgggagctgg agcagcttca 600 

tctgggcctg tatggcacgt cagaccggct gacttttacc aangagggca agnccctttt 660 

gaccagaant ggtgcataca ctgcgaagaa ctggactttg aatggccaag ggttcaattg 720 

agaaagacct ggaacangcc cttgctnct 749 

<210> 724 
<211> 761 
<212> DNA 
<213> Homo sapiens 



<400> 724 

ttnnnnccct ttttaatncc ttctactaat ccttggctct cgntctttct gcaggatccc 60 

atcgattcga attcggcacg agcctcagcc ttctaaaaag ctggggctac acccagctga 120 

agaaattgta actaaagata gattgtttaa agcaaagcaa gaaacttctg aagaaatgga 180 

acaaagtgga gaagcctcag gaaagcccaa cagagagtgt gcaccccaga ttccttgtag 240 

tactcctatt gctactgaaa ggacagttgc acatttgaac actctgaagg accgtcaccc 300 

aggtgatttg tgggcccgca tgcacatctc atccctggaa tatgctgcan gagacattac 360 

ccgaaaaggg agaaaaaaag acaaagctcg agtgagtgaa ctgctccaag gcctctcatt 42 0 

ctctggtgac tcagatgtgg aaaaagataa tgagcctgag atccagcctg ctcaaaagaa 480 

gttaaaggta tcatgtttcc cagaaaagag ttggaccaaa agagacatta aacccaattt 54 0 

tccaagctgg tcagcactgg attctggact tttgaatctc aagagcgaaa agtttgaacc 600 

cagtagagct ttttgaatta ttttttgatg atgaaacatt caacttaatt gtcaatgaaa 660 

ccnataatta tgcttctcag aaaaatgtca gctttggaag tccagttcag gaaaaaaaan 720 

nnnnannaaa aaactcgagc ctntanaact atngtgagtc c 761 



<210> 725 
<211> 760 
<212> DNA 
<213> Homo sapiens 



<400> 725 

tttcnccccn tttttanccc cttnctctaa tccttggntc tngttctttt tgcaggatcc 60 

catcgattcg aattcggcac gaggcggact ctcaggacga aaagagtcaa acctttttgg 120 

gaanttcaga ggaagtaact ggaaagcaag aagatcatgg tataaaggag aaaggggtcc 180 

cagtcagcgg gcaggaggcg aaagagccag agagttggga tgggggcagg ctgggggcag 240 

tgggaagagc gaggagcagg gaagaggaga atgagcatca tgggccttca atgcccgctc 300 

tgatagcccc tgaggactct cctcactgtg acctgtttcc aggtgcctca tatctcgtga 360 

ctcagattcc cgggactcag acagagtcca gggctgagga actgtccccc gcagctctgt 420 
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ctcccttgct agagcccatc agatgctctc accagcccat ttctctactg ggctcctttt 480 

tgactgagga gtcacctgac aaggaaaaac ttctatcagt actttgatat gtcacagttt 540 

catgtttatc cagttcaatg tatttttaaa tttttccttg agacttcttt gactgataga 600 

ttattgtgaa gtgtgttttt aaatttncaa atgtttangg attttcatat ctttcttatg 660 

ctgatttcca attggattcc ttacaatgat ttttgggttt catctgctct tggatgatta 720 
ctatctcttt taaatttggt gtggccaagt tttagggccn 



<210> 728 
<211> 765 
<212> DNA 
<213> Homo sapiens 



760 



<210> 726 
<211> 741 
<212> DNA 
<213> Homo sapiens 

<400> 726 

ttntgccctt tgtntnatcc ttgntcttgc ctttttgcag gatcccatcg attcgaattc 60 

ggcacgagac aagttctatt gagtgctatt cagaatagga acaaggttct aatagaaaaa 120 

gatggcaatt tgaagtagct ataaaattag actaatctac attgcttttc tcctgcagag 180 

tctaatacct tttatgcttt gataattagc agtttgtcta cttggtcact aggaatgaaa 240 

utacac 99 ca acaggctcaa caggtgtaat agcccactta ctcctgaatc tttaagcatt 300 

tgtgcatttg aaaaatgctt ttcgcgatct tcctgctggg attacaggca tgagccactg 360 

tgcctgacct cccatatgta aaagtgtcta aaggtttttt ttggttataa aaggaaaatt 420 

tttgcttaag tttgaaggat aggtaaaatt aaaggacatg ctttctgttt gtgtgatggt 480 

ttttaaaatt tttttttaag atggagttct tgttgcccag gctagaatgc aatggcaaaa 540 

tctcactgca atctcctcct catgggttca agcaattctc ctacttcagc ctcccaagta 600 

gctgggatta caggcatgtg ctaatttggt gtttttaata gagatgaggg ttttccatgt 660 

tggtcangct ggtctcaaac tcctgcctta ngtgatcgcc tcggcctnct aaagtgctgg 720 

aattcaggca tgaancncca t ? 

<210> 727 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 727 

ccttcttccn aangctntgt tgaacncctt tcnnnatcgc gcttgcgctt tgagctagga 60 

taaaaattgg gtaaagggac atttgcttac ctgnntnatg aatcactntt tgaaatgtga 120 

tcttgccata tcatcaagaa acttgttttc tggatgaata ctgggagaat aaaatgagaa 180 

ctctggagtg agctaaattg atcccaatna agtttttctg cttagcagac agaaggtata 240 

attntttgac accctttccc acctggtgcc tatgctaggc ttgtcctgan aacatncctc 300 

agtaacttga tattcacatg acctacagga tgtcccatct gcagggctga gtcagttggg 360 

gaacaccaga ggctacacag tagctattcc tgctactcgg ttaatgagct tggcaggttc 420 

tttgtctcac tgaattctta tcatggaaac agcagcagca gccgctagga aatcttcaag 480 

tgtagnggcc tgtgctaacc cagtggtaaa tcccttagat cccctgctgg tctctggcaa 540 

aactccttga tnttgggtac catgtatant ttgcctttga cntttaacgc tttctacgat 600 

anggtaanca cncntttaat ttangcnctg gancattaac tttctttgca aaggctactt 660 

atngccngnc acaantgcag cctcggacan ancnnangnn atatcctgtt ggccatggct 720 
ntgatgtttg acanccgata ngccttctnc g 



751 
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<400> 728 



tngnntttnt 
cctgcagatt 
ggacatgctg 
cctgaactct 
tacctgtcac 
nccttcttnt 
gtnccncnnt 
acacactnac 
atgggcgtng 
aagccaggtt 
aacttgtgng 
natnnagggc 
ttctnaaaat 



ttaacnttgt 
gcagagcggc 
gagcagatgg 
ggaggagaag 
aatganatta 
tcctncacct 
gttcagtccg 
agagangcca 
tntggccaca 
nnttnataga 
aacantttaa 
taaacaagct 
nttcaatntn 



ttgacgcctt tctgcaggat ccctcgattc gcactggcta 
gagagcccat aggcagcatg tcatccatgg aagtgaacgt 
acctgatgga catatcggac cangaggccc tggacgtctt 
agaacactgt gctgtccccc gccttagggc ctgaatccag 
ccctccaggt tccaaatncc tcagaattaa gagccaancc 
gcaccgactn ggncacccng nacatcanng agggtgggga 
atgaggagga anttcangtg gacactgncc tgnccacatn 
ctcnngatgg tgntnangac agcaactntt aaattgggac 
ctggaatcca nntttggctg tatgcggaat ttcacctgcn 
cgttcttgat tattacataa ttgccaatca tgtggtgagn 
caattaantg tgaagaccgt acaangaatt agttaaangc 
attacttntg annnaantta angnatntaa nntttnctgn 
nngggaacan ttgtaanttt nncnt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
765 



<210> 729 
<211> 743 
<212> DNA 
<213> Homo sapiens 

<400> 729 

tannnnttnc tntanrmttt ctgangccct tntgcaggat cccatcgatt cgaattcggc 60 

acgaggagat ctctgggatg tcagtgaggc tggttgaaga ccagaggtaa actgcagagg 120 

tcaccacccc caccatgtcc caggtgatgt ccagcccact gctggcagga ggccatgctg 180 

tcagcttggc gccttgtgat gagcccagga ggaccctgca cccagcaccc agccccagcc 240 

tgccacccca gtgttcttac tacaccacgg aaggctgggg agcccaggcc ctgatggccc 300 

ccgtgccctg catggggccc cctggccgac tccagcaagc cccacaggtg gaggccaaag 360 

ccacctgctt cctgccgtcc cctggtgaga aggccttggg gaccccagag gaccttgact 420 

cctacattga cttctcactg gagagcctca atcagatgat cctggaactg gaccccacct 480 

tccaactgct tcccccangg actgggggct cccangctga nctggcccag agcaccatgt 540 

caatgagaaa gaaggaggaa tctgaacctt gggtaaggat ttggggcaca gtaccaggaa 600 

gggggcttgg tgccagacct tatgaggaag aaggattttc ctatgtacag agaangggac 660 

cctgtnctgt tgggaagtgc ttgtgcaaac ctaaccaagt tactaacccc tctgntttct 720 

gtgctacaca aaggggataa att 743 

<210> 730 
<211> 744 
<212> DNA 
<213> Homo sapiens 

<400> 730 

tttnttcctt cctctaatcc ttttancgcc tttctgcagg atcccatcga ttcgaattcg 60 

gcacgagggg tcctccaaga gtttggggcg cggacnnnag taccttgcgt gcagttatgt 120 

cggcgtntgt agtgtntgtc atttcgcggt tcttacaaca gtacttgagc tccactccgc 180 

agcgtctgaa gttgctggac gcgtacctgc tgtatatact gctgaccggg gcgctgcagc 240 

acggttactg tctcctcgtg gggaccttcc ccttcaactn ttttctctng ggcttnatct 300 

cttgtgtggn tgagtttnat cctagcggtt tgcctgataa tacngatcaa cccacngaac 360 

aaagcngatt tccaaggcnt ctgcccagag cnagcctttg ntgannttct ctttgccagc 420 

accatcctgc accttgttgt natnancnta ggtgnctgaa tcattctcan ttncntaatt 480 

gangagtang anactaaaag aatgttgact ctttgaatct gctggataag agactngaga 540 

tggcagctta ttggacacat ggattttctt cngatntgca cttactgcta gctntgctan 600 
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ctatgcagga gaaaagccca tagttactgc gtgtnacaac aactntctaa cnaacattca 660 

ttaatccann ngannccttt caangaatgg taancctatg ccnttcaana tactgaactt 720 

nntgccactt ntggcaaaaa aaat 744 

<210> 731 
<211> 746 
<212> DNA 
<213> Homo sapiens 

<400> 73X 

cttattccct ttgnaactna ctctttntca tccctttgtg caggatccca tcgattcgaa 60 

ttcggcacga gtgtccttat ctgaaattca gcgatcttnt tgaataagca tttctctgat 120 

tgtggtatat gcctttaatt ttatttctag agtgacaaat ttttggtttt gacagttttt 180 

ttctagcttt atagtttctt cttggggaga gaatatgtca acctcactcc atcatgctga 24 0 

agtaaatctt catctcttaa ttttatctct caaaaatatc ctaaggattc cctctggagc 300 

ctgataagta attgcagtat ctggtttcta tggttggatg attcaggatt ccaggaataa 360 

tagttacttt ttagacctct aaagaagaag taacaaccac gtaaatgaaa agatgcttct 420 

taaatcatgg agaatcaggg cttagtatca ctgtattttc aaactgtttc agccttactt 480 

tataactgat ttagtatatt tttcttttaa tttcagactt cagtgaagtt ccttatgact 540 

tcccctgaaa ttgcttcctt atcatggggg caaatgaaag taaaaggctc taatacaacc 600 

tataaggact gcaaagtatg gccagggggt agtcngactt gggattggag agaaacagga 660 

actgagcatt ctcctggtgt gcacctgcag atgtgaagga agttgttgag aanggtgtcc 720 

agactcttgt gattggncna nggata 746 

<210> 732 
<211> 756 
<212> DNA 
<213> Homo sapiens 

<400> 732 

ttimnnncnn nnatcctttn gatttnattc ctntntcang tcctttgtgc aggatcccat 60 

cgattcgaat tcggcacgag gtggcccata agttttacct tttaaacatc cggctgcctg 120 

tgaatgagaa gaagaaaatc aatgtgggaa ttggggagat aaaggatatc cggttggtgg 180 

ggatccacca aaatggaggc ttcaccaagg tgtggtttgc catgaagacc ttccttacgc 240 

ccagcatctt catcattatg gtgtggtatt ggaggaggat caccatgatg tcccgacccc 300 

cagtgcttct ggaaaaagtc atctttgccc ttgggatttc catgaccttt atcaatatcc 360 

cagtggaatg gttttccatc gggtttgact ggacctggat gctgctgttt ggtgacatcc 420 

gacagggcat cttctatgcg atgcttctgt ccttctggat catcttctgt ggcgagcaca 480 

tgatggatca ncacgagcgg aaccacatcg canggtattg gaagcaagtc ggacccattg 54 0 

ccgntggctc cttctgcctc ttcatatttg acatgtgtga gaaaggggta caactnacga 600 

atcccttcta cagtatctgg actacagaca ttggaacana gctggccatg gncttcatca 660 

tcgtggctgg aatctgcctc tgcctctact tcctgtttct atgcttnatg gnatttcaag 720 

tgtttcngac atcantggga agcaatccac ctgccn 756 

<210> 733 
<211> 742 
<212> DNA 
<213> Homo sapiens 

<400> 733 

cntatccttt nntttattcc ttnataagnc cttnngcagg atccatcgat tcgaattcgg 60 
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cacgagctca cacctgcttt ggatgcttca agcacctcag ccctctgaac tacaaaacag 120 

aagagcctgc aagtgacaaa ggaagtgagg cagaggccca catgccccca ccgttcacac 180 

cctacgtgcc tcggattctg aacggcttgg cctcggagag gacagcactg tctccgcagc 240 

agcagcagca gcagacctat ggtgccatcc acaacatcag cgggactatc cctggacagt 300 

gcttggcgca gagcgccacg ggcagtgtgg ctgctgcccc ccaggaggcc tgaggctggg 360 

tctcactgct ctgaaaagac acaaccagaa tggcctgggg ctcaggccct tggctgagtg 420 

ggaatgcgtt gggactgccc agctgagcta tcaggtgccc atcttttctg gtcccagcag 480 

tggtgaggag agcacaggca ggcctcgccc ctcccttgct cacccagttt cccctncggc 54 0 

acaagcttcc agctctgcag ctggggtgac atccccagtg gtttgtcgcc aagacatgtg 600 

gtggactttt cgccccccaa actgatgagt nccggagaat atatggagag agagatgtaa 660 

aaaaaaaaaa nnnnnnnnnt nntnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 720 

nnnnnnnnna annnnananc tc 742 

<210> 734 
<211> 749 
<212> DNA 
<213> Homo sapiens 

<400> 734 

nntanaatcc ntttnnctnt aatccctcta ncaaatccct tgggcaggat cccatcgatt 60 

cgaaaattta tagtaatgac aaatgactta tcagtgttca tcatctgaaa gctaagtggt 120 

tcgttcaatc actttttcaa agttgatagt agattgcatg gtttcatgtt tcctcatatt 180 

ggtttattaa ttctatttaa tcaaggaaaa taacttcaga ttccataaag tttcagttta 240 

tttttagttt actactaggt gagatagcac attacatact tttactatca aatattattt 300 

tagcagcttc ccatagtacc aaatgatttg attccctact ctcatttttt aaagcatata 360 

aatatttatg ggcttaaaaa gggggttttt aaaaactgag gatatcagta ataaattgca 420 

gaatattttg caaagctttc ttttggaaag caaacttttg tgcctgccta tatgcaaagt 480 

attttatcag ggacttgaac aaagacctca ctctttttca cttgtcttat gtcgagagaa 540 

aagggtattg gcagccacat tcctaagact ggggaatggt gtgtcccttt aaatttgaag 600 

ataactttan gtaattatng gaactcctca aagaggagaa agtaattttt tncagacatt 660 

ttctcaatct gggnctttca cacactantt tncatagtcg agaatctggt tttacccatt 720 

gggctgngaa tgtccaatat cagtcctgg 7 49 

<210> 735 
<211> 770 
<212> DNA 
<213> Homo sapiens 

<400> 735 

gngntnngnn gttnnnttnt tttnaatnta atccttgtnt naantccttt tgcaggatcc 60 

catcgattcg aattcggcac gagggtgccc atcaccacac ccagctaact tttgtatttt 120 

tagtagagac ggggtttcac catgttggcc aggctggtct tgaactcctg acctcgtgat 180 

ccgcccgcct tggccccgca aagtgctggg attacaagca tgagcccagc gcctggctgt 240 

atctttcatt ttacccaagt cactttaccc aagtaagtaa ttaggggaaa gcctgagtct 300 

tgtaccacct gttcatttgg ggaactgtgg gaaacggagc caacggacct aagtgccctt 360 

tgacagtgag tttcatacca tttcagtagt gtatttcttt cttaatctga ataaaccaga 420 

atgatactct cagcacagaa gaataaaggg agcgagtcat taacgttntc tttttaaacc 480 

tttatgatga cttncttatg aattactgaa cgaacactgg aatgggactc acgtatcctg 540 

aggacatctc tcaactctgg ccttantttc ccctctgtaa aattagggtg ccaactaaat 600 

gatctacaag gtccctttnc aagcgcccgn cattctgtaa ttacatcatg tggaactgna 660 

ttaaacatac accagtgaac tggcangcat tgggaatgta actttcccag taaaatgctt 720 
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tnggtttggt tcaaaataca ctntgaactt cttttcaaag acnggttnng 

<210> 736 
<211> 746 
<212> DNA 
<213> Homo sapiens 

<400> 736 

tttnnctttt attcaaatnc ttgcnggatc ccttgattcg aattcggcac gagggatgnc 
catcgatgct natcnggcac gaggtgatgn cagcttgcaa actggtctac atnncaaact 
gatagtacat tgccatctnc aggaagactt gacggctttg ggattttgtt taaactttta 
taataaggat cctaagactg ttgcctttaa atagcaaanc agcctacctg gaggctaagt 
ctgggcagtg ggctggcccc tggtgtgagc attagaccan ccacagtgcc tgattggtat 
agccttatgt gctttcctac aaaatggaat tggaggccgg gcgcagtggc tcacgcctgt 
aatcccagca ctttgggagg ccaaggtggg tggatcacct gaggtcagga nctcgagacc 
agcctggcca acatggtgaa accccatctc tactaaaaat acaaaaatta gccangtgtg 
atggtgcatg cctgtaatcc cagctcctca gtaggctgag acaggagcat cacttaaa 
cyggangcag angttgcagt gagcccgaga ttgcaccacc gtactnnaac ctgggtgaca 
gagcgagact tatcttatan ataaatagat ngatcttcac ctgggtgaca naacgagact 
tatagataga tagatagata gatggataga tngatngatn gatagataga ttgataaacg 
gaattgggcc ttttgcttta atgaaa 9 

<210> 737 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 737 

ntnnnncttt ttgatcantc ctttnttgga tcccnttgct acttgttctt tttgcaggat 
cccatcgatt cgaattcggc acgaggctga cctacagcag aagctgctgg atgcagaaag 
tgaagacaga ccaaaacaac gctgggagaa tattgccacc attctggaag ccaagtgtgc 
cctgaaatat ttgattggag agctggtctc ctccaaaata caggtcagca aacttgaaag 
cagcctgaaa cagagcaaga ccagctgtgc tgacatgcag aagatgctgt ttgaggaacg 
aaatcatttt gccgagatag agacagagtt acaagctgag ctggtcagaa tggagcaaca 
gcaccaagag aaggtgctgt accttctcag ccagctgcag caaagccaaa tggcagagaa 
gcagttagag gaatcagtca gtgaaaagga acagcagctg ctgagcacac tgaagtgtca 
ggatgaagaa cttgagaaaa tgcgagaagt gtgtgagcaa aatcagcagc ttctccgaga 
gaatgaaatc atcaagcaga aactgaccct tcttcaggta gccagcagac agaaacatct 
tcctaaggat acccttctat ctncagactc ttcttttgaa tatgtcccac ctaagccaaa 
accttntcgt gttaaagaaa agttnctgga caaaacatgg acatngagga tctaaaattt 
ggtcanagca tctgtgaatg agcatganga t 

<210> 738 
<211> 795 
<212> DNA 
<213> Homo sapiens 

<400> 738 

aatccctttg ctttaancct tgtttgaacc cctttggaac tncctctntn tgnaggatcc 
catcgattcg aagagcncan gcaggaagag agagaccctn actgctgggg anttnctgcc 
acactcaagt ccccaaccca ctggaatctc ccctactaca agtgccatgt anaccccttg 



770 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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720 
746 



60 
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180 
240 
300 
360 
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60 
120 
180 
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aaaaggggag 
tcaaccaaaa 
gtagatccag 
tgaaaaaaat 
antctgcagt 
gnganntgtt 
nnacccanct 
gcanangctg 
tnaatatcaa 
gttgaactta 
nggaaaacct 



gggcctaggg 
aganaantan 
acatgattng 
gcnttatntn 
aaacaaantt 
tggcgttnat 
ntagttccct 
nttttcccnn 
acccggtaag 
accnganatt 
tcccc 



agccgacctt gtcatgtacc atcaataaag taccctgtgc 
anaaaactcn agcctctaga actatagtga gtcttattac 
anacattgat gagtntngac aaaccacanc tcgaatgcng 
tgaaanntga natgctatat nnntcattnn ttaccattnt 
tacagcancn nttgnntnga tttcatgtnt caagttcaag 
ntaattcggc ccnacncgng acccttttgc attgggcccn 
nttagngagg ggnaattgcg cnctttggcg taataatngg 
tgtnnaaatt ggtttatcca gtttannaat ttcaacacga 
cnattaaatg gtnaaaaacn ntgnggggng cccttaanga 
aaattgcnnt tncgcnttna atntcccncn ttttaaatcc 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
795 



<210> 739 
<211> 763 
<212> DNA 
<213> Homo sapiens 

<400> 739 

ttnnnnncct catnaatccc ttctttgatc cctcnccnca aaacccttgg enact cgctc 60 

tttntgeagg atcccatcga ttcgaattcg gcacgaggca nccttcgcct cctgggttca 120 

agtgattctc ctccctcaca tcccaagtag ctgggactac aggcacgtgc caccacaccc 180 

agctaattnt tgcattttta gtacaggcag ggcttcatca tgttggccag gctggtctca 240 

aactcctgat ctcaagtnat ctgcccactt tggcctccca aagtgctggc attacaggaa 300 

tggagccacc gcgcccagcc tgatttcttt anntangtct tgtcangaaa natattgant 360 

ctnttgattc ntnaacatgg cnttnggtcg tctttaatnn gnctcatcan tgcctccatg 420 

tgttnttgat gecttngaac tggtattttt aaaatnncaa tttctaattg nnnattatnn 480 

aaacacaatt gggntnnata tattggcatt gtattaatgc aactttccta aactcactag 54 0 

taattctagt agentnantt ggtanattct taaggatttn ctgngtnaat agncatgtca 600 

tctgtgaatn aagecattet ttganccttt tcaaattttg agecttgtat ttcttattct 660 

taccatatca cattggcaaa gacctccagt atganattga ataaangtgg tganagaaaa 720 

caccctncta aaantgctng aattacaggc atgaaccacc ntn 763 

<210> 740 
<211> 765 
<212> DNA 
<213> Homo sapiens 

<400> 740 

tnnnnnnnnn tttttnaacc ntttnttgna tnentctnte aaatcgcttg gctacttgtt 60 

etttttgeag gatcccatcg attegctage ctgggcaata tagtacgacc ctgtctttac 120 

taaaaatgea aaaattaacc acgtatggtg gctcacacct gtagtcctgg ctactgagga 180 

ggctgatgea ggagaatcat ttgaacccag gaggtcaagg ctgcagtgag etatgattge 240 

accactgcaa tccagcctgg acaacacagt gagaccctgc ctcacaaaaa ttatattctg 300 

attttctgag tccatgaaca cattgtccaa atggattttt ctagctcctc caagttacag 360 

atagttccac gcacacacag aactcaccac tctcaaatat tttccccact agtattacta 420 

ttaaattttt caaacatgea aaagatgaaa gaattgetea gtgaacacca tgtacccacc 480 

acctagattc tacaattaac attttaccct actttcttta tcacatatat gtacctatcc 540 

atctatccat tcttccatga atccatcaat tcatctaatt ttttatatat ttcaagttaa 600 

gttgeagata tgtagcttat gtttcacctt aaatgtttct gectggctat tattaactgg 660 

agtgcaatat gtttttggnt cttctttatg gtaaaatcta tgttcagtga aatgcacaag 720 

acttangtat gecattaata gggtttgacg aatagacaaa ccttn 765 
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<210> 741 
<211> 753 
<212> DNA 

<213> Homo sapiens 
<400> 741 

ttnganccnt tnnntnnttn nntnaatgaa gccatttgct acttgntctt tttgcaggat 60 

cccatcgatt cgaggaaggt ggaggggcag gnaacaggac ggacaggccc cgggctctgg 12 0 

cacatcctgg ggaacaaggg accacaagga cgggggcagt ctccagactt cccctgggcg 180 

cttgacccca ggccttgcag gggagagagc cagggcctcc ctcaggtctt tgttcatgct 240 

gttttccctg ccgtggacac cctttcccgc tctccgattc tctaaatcct gccccatctc 300 

ccagatcttg ttcatgtcca agcttttcca ggaagtctta gcagctccca caccgcagag 360 

ctcgagatgt ctccctgact tggtcccaga ccccaactat gtgcaagcat ccacttatgt 420 

gcagagagcc cacctgtact ooctgcgctg tgctgcggag gagaagtgtc tggccagcac 480 

agcctatgcc cctgaggcca ccgactacga tgtgcgggtg ctactgcgct tccccancgc 540 

gtgaagaacc agggcacagc agacttctnc ccaaccggca cggcacacct gggagtggca 600 

caactgccac cagcattacc acagcatgga cgagttcanc cactacgacc tactqcratgc 660 

aaccacaggc aaanaangtg gcccanggcc acaaaggcca atttctgnct ggaggacanc 720 
acctgtgact tnggcaacct naaacgctat gen 



<210> 742 
<211> 767 
<212> DNA 
<213> Homo sapiens 



<210> 743 
<211> 768 
<212> DNA 
<213> Homo sapiens 



753 



60 



<400> 742 

tnganccttt cgnttctnen ctcctaagcc tttgetaett gctctttttg caggatccca 

tegattcgea ggacatggag cagtacctgt ccactggcta ectgeagatt geagagegge 120 

gagageccat aggcagcatg tcatccatgg aagtgaacgt ggacatgctg gagcagatgg ISO 

acctgatgga catateggae caggaggccc tggaegtett cctgaaotct ggaggagaag 240 

agaacactgt gctgtccccc gecttangge ctgaatccag tacctgtcag aatgagatta 300 

ccctccaggt tccaaatccc tcagaattaa gagccaagcc ancttcttct tcctncacct 360 

gcaccgactc nggcacccgg gacatcagtn agggtgggga gtcccccgtt gttcaanccg 420 

atnaggagga agttcaggtg gacactgccc tggccacatc acacactgac aganaggeca 480 

etceggatgg tggtgaggac agncactctt aaattgggac atgggcnttg nctggccaca 540 

ctggaatcca ngtttggctg tatgengaat tncacctgga aaagecaagg ttggtntata 600 

ganggtcttg atttttacnt anttgncaat aatgggttga gnaaacttaa agaaccagtt 660 

taacaataaa atngttaggg accegttnan aaaatggang tctnccttcc atntnaacct 720 
ggannccttn aaacntttnt gngtccnaat tttcgttnca tccannn 



767 



<400> 743 

naancctttc nnncttcgen attcnaanng ntnggaaagc tcantcgctc natagngenn 60 

gggcttcgcg agnnntggga natnacanag gctngttanc ataccngttt ttnactgoan 12 0 

aggnnnccac angcagcatg geccatgnna tgnccatgcc antgatggen ggnggccatg 18 0 

ctgtcagcgg annegcaett gtgaggancc nntntggann cngtanncna canncacccc 240 

cagtctggna cccnagtgtt cttactacac caantgaaac gctggnnagc caagagcccn 300 
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gatggcccac 
aagccanaag 
acctnnacgg 
tgggnccnng 
accatacanc 
ctgnggcatt 
gnactatgtc 
gatntaacnc 



gtnccctgca 
cagctgtttt 
nctacattga 
attccaggan 
cnttgtanna 
agtnnntcaa 
catgnaaggg 
tttganaaac 



tgganccccc tgancngact ccaccagcct atacangngg 
cngccntgcc ctgctgataa tgccttgaag accccatacg 
cantnngact gtgncancct ngatcagatn atcctggaac 
cttnccntca atggacctgg gngcttgtaa tcngttntgg 
gataaaagan ngaggaaatc tgaaaccntn gnaataagat 
ggggaggntn ggtnncaaaa cnctatgagg aagaacgatg 
gaacatntan tgttgganna tgcnatgcaa ncntnnccnt 
tnangcttna caaaggggga aaaanact 



360 
420 
480 
540 
600 
660 
720 
768 



<210> 744 
<211> 757 
<212> DNA 

<213> Homo sapiens 
<400> 744 

tnnnnncnnt tnnnnttnat ncntctctca aatcgcttgg ctacttgttc tttttgcagg 60 

gatcccatcg attcgcttga cctctgtact ttaaaggaaa tcactaacca aattttcaaa 120 

gtttcctttt aaatgcgttt agctagaaat ctatgtattt atccctttcc tattttgcat 180 

tcttctccca ctatttttaa aaactcattt acagtagaaa ccattcttct ttctcccaac 240 

agtatccttt gccaagacca tgagaacagt aaggagcatg ttgttggtca gggtttcaga 3 00 

atacgcgtga tgtcactgag aatgtttgct cacagtcaat aattgtcttt gtggatgtga 360 

taattttgga gatacacttc tggtcagaac tcaggtgaga taatcttgca atactccaaa 420 

tgcagatact ccagccaccc gcaaggttcc aggaaaggac aatgtcctgc gagaaaatca 480 

ggaggcctcc acttcctggg ccacttgaga agttcctggg catgtcacta catgttggtt 540 

gactcagcca tttctcatgc tgntttgttt cttgcggtgg ccacttaacc ccaaagaatg 600 

aanggaggat ccacagtgaa agtgcctgag tttctctatg agaccagatg ctgtcgaaac 660 

caaacatctt ttcctttgct ctatnggaac attttaaggg ttggtttgca caactggttt 720 

tcagactngg aagattacca agtttgggtc ccccctn 757 

<210> 745 
<211> 751 
<212> DNA 
<213> Homo sapiens 

<400> 745 

cttnttnnnt ttnntttgat ncctctacnc aaacccttgg ctactngctc tttntgcagg 60 

atcccatcga ttcgaattcg gcacgaggaa naacagacag gtttcaacat ggatggatct 120 

gaaatgctgt tgaagcatat catttgcata aaaatcaggg acagtttcca aagaattata 180 

tatttttttc agttggctct ctagttagtt tttttgggag taaggacaaa cctggaatag 240 

atagcaaaac tgaaaatcan cagtgctgat ggtggtacat atgtctttcc tttagcttct 300 

cccctgataa ttcccatctg cttttacttc gggtgagcag agggggatgt gtgtgtgcgt 360 

gtgtgtcagt ctgtttgtga gtgtgttaaa ggctacagac cacagttggt ttaaaatgct 420 

tggaacttcc caaactggct ttactttatg tttatacagt gctcagggtt aacgcagtac 480 

atccatgcca ttgctgtggg aggtatcccc ggatgcatgt gttttgagtc tataaatata 540 

gaaaatatat attggtttct ttttccaact taatangttt attaaagcat gaaatgaaag 600 

ggtgcatatc atgcattcaa gntatntcct aatttttggt ctgacagtgc atgtctttgg 660 

agcatgctga aacaanaatn acacaggaat tgantaaccn gaaagaaaca ttgttaaatg 720 

tccaacattt gttatgcatt tntattgggg g 751 



<210> 746 
<211> 760 
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<212> DNA 

<213> Homo sapiens 

<400> 746 

tnnnnntntn nnnnntttnn nttcntnnnn ctttgaancc ctttgctact tgctcttttt 60 

gcaggatccc atcgattcgc tgaaacaaaa gatgtatttc aattaaaaga cttggagaag 120 

attgctccca aagagaaagg ctttactggn tntgtcangt aaaagaagtc cttcaangct 180 

tagttgatga tggtatggtt gactgtgaga ggatcggaac ttctaattat tattgggctt 240 

ttccaagtaa agctcttcat gcaaggaaac ataagttgga ggttctggaa tctnagttgt 300 

ctgagggaag tcaaaagcat gcaagcctac agaaaagcat tgagaaagct aaaattggcc 360 

gatgtgaaac ggaagagcga accangctag caaaagagct ttcttcactt cgagaccaaa 420 

gggaacagct aaaggcagaa gtagaaaaat acaaagactg tgatccgcaa gttgtggaag 480 

aaatacgcca agcaaataaa gtagccaaag aagctgctaa cagatggact gatnacatat 540 

tccaataaaa tcttgggcca aaagaaaatt tgggtttgaa gaaaataaaa ttgatagaac 600 

ttttggaatt ncagaagact ttgactacct ngactaaaat attccatggt ggtgaaagat 660 

tttcaagctt gngaatttgt aaattttnaa ctattatcta actaatgtnc tgaattgccn 72 0 

ttggctgtac tgggttatca ttttattaat ggtaaataaa 760 

<210> 747 
<211> 786 
<212> DNA 
<213> Homo sapiens 

<400> 747 

tnngncttta nnccntttnn attgnnnnnn nttgaaaccc ttggcnactn gctctttntg 60 

caggatccca tcgattcgaa ttcggcacga ggaggctgtg tcaaagaatg aatggaacgc 120 

ctactatgag gaggtgggtg tacgtnctag anggagatcg agtacatgat ccagaagctc 180 

cctgagtggg ccncggatga gcccgtggag aagacgcccc anactcanca ggacgagctc 240 

tacatccact cggagccact gggcgtggtc ctcgtcattg gcacctggaa ctaccccttc 3 00 

aacct caeca tccagcccat ggtgggcgcc atenctgean ggaactcagt ggtcctcaag 360 

ccctcggagc tgagtgagaa catggegage ctgctggcta ccatnatccc ccagtacctg 42 0 

gacaaggatc tgtacccagt aatcaatggg ggtgtccctg agaccaegga getgetnaag 480 

ganaggttcg accatatcct gtneaeggge ageaeggggg tggggaagat catcatgacc 540 

getgntgeca agcacctgac cctgtnacgc tggaactggg aaggaagagt ccctgctacg 600 

tgggacaaat aactgtgaac tggaccttgg nettnetaac attggncttg gggggaaatt 660 

catnaacaag ttngecaana cctgcgtggg cccctgaaat acattctttt nggaccccct 720 

tgnatccaga accccaattg nnngnngaaa acttnaaana aantnncttt naaaannntt 780 
tttnet 



<210> 748 
<211> 722 
<212> DNA 
<213> Homo sapiens 



786 



<400> 748 

tggaactngc tetttntgea ggatcccatc gattcgaatt eggcacgagg aggaagaggc 60 

ctgctccact tgtctgggaa cctgggcagg aggcacagag gaagecaagg cctggagctg 120 

caggtccccc ggcatctctc tctgtcccgg cagcccagga tggcctggtg cccccacctg 180 

ctgeagcagg agccccaagg agtgctagct gagggtggtt gctggggtgg tcctcatgga 240 

cagtgaggtg tgcaagggtg cactgagggt ggtgggaggg gatcacctgg gttccaggcc 300 

atecttgetg agcatctttg agcctgcctt ccggtgggag canaaaaggc cagaccctgc 360 
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tgagttanag 
tggcanagaa 
tgctgaaacc 
aggaaagcct 
ttggccatga 
tanttggaaa 
tg 



gctgctggga 
gtgccatgtt 
cacgagctgn 
gtctttggtt 
tatttgaaaa 
attttttgac 



tccactgttt 
tgcgtngaac 
acantnanga 
agctcgtgtc 
aggggaagga 
cccctnngct 



ccacacancn 
cttgcantct 
gctgtccanc 
ttctgcagga 
tngccnaant 
taattctttt 



ggaaggctgc 
tncanctggg 
ttgcttggct 
aaaaaaaaag 
ttgtttncca 
gcaanaacta 



tgggaacagg 
gactggtnct 
cactgngacc 
gatgtgtcat 
tttattccag 
ctggggggtn 



420 
480 
540 
600 
660 
720 
722 



<210> 749 
<211> 821 
<212> DNA 
<213> Homo sapiens 



<400> 749 

tttnaanncc cttgctactn gttctttttg caggatccca tcgattcgtn gacatagaaa 60 

acatacagta agaatatggt attataatct tacggggacc actgtcaaat cgcggtctgt 120 

ctttgaaaag ttgtnatggc ggcgcatgac tataaatacc ctagctggtt agcatttaca 180 

ttccttgcca gggagtttga aatttatnct nggcgggctg nctttaggnt ttaggtagag 24 0 

ttaaagaggt aaagcacatg tttgccacaa cccaggaaag tatttttaag aaagatttgg 300 

attttcctac ctttagagat ctaaaaaaaa tttaatataa aaaatcattt tgagntggtg 360 

tttattacta gttcagaatg agtggctgct gaagggggcc cccttgnnat tttcattata 420 

acccaatttt ncactttatt ttgaactctt aagtcataaa tgtataatga ctttatgaat 480 

tagcacaggn taagttgaca ctttgaaact ggccatttct gnattacact atcaaatagg 540 

aaacattgga aagatnggga aaaaaaattc ttattttaaa atggcttaga aaagttttca 600 

agattacttt ggaaaattct aaacnttnct ttctgngttc caaaactttg gaaaatatgg 660 

tagatnggac ctcattgcca tttaagactg gttttcaaaa gctttccctc aacatttttt 72 0 

aaaggtgtgg anttttccct ttttaaatat tccataattt aantttcctt ttnaaaggcc 780 

nctnnttttc ccaaacccat ngncttttgg ggnaaatccc c 821 



<210> 750 
<211> 770 
<212> DNA 
<213> Homo sapiens 



<400> 750 

gntttnnnnn nnctttnttn nntgnctntt tctaagagct tngcnnatgc tnggtcggca 60 

cgaggcaaca tttgtctaca actctactgt aaaattggaa atgcttttcc acagaaaaac 120 

ctctcaaaat gctgaatgca aaagttggga tcacagaaac attgtgccta tttttggtct 180 

gctggaaact gtatttntac aaggtaatcc ctgttctcaa tatagttcct gtcttgccac 24 0 

tggcggtttt cttgtagcat ttttctagtt ctgagattgc tactacccaa agtattcatt 300 

tctttcttac tggggtgtcc tctgtcttca cagcctgctt ctggattgta ggttttttcc 360 

tttctttctg ttgagatatt tatggcattt gatagagtca aaccagatgt attgcagccg 420 

gacatactta tgtggcttca gatgtgtaaa ataagtaact tcctatcttt gtctgtctag 480 

ctcaagagtt gactgtggac gaggaatgcc tgtattgatt cattaatgta ataactattt 54 0 

actgactgcc taccatgtac aaccagaaac acagttccta acctcatgaa cttaccatgt 600 

aacatgggaa gacaagccta agttcttatt tggntggnaa ttgcgataac gctcacagaa 660 

caaattcccg attcctacga acccatgtat aggggggaaa tatttaaggt cccatttaat 720 

actgacattn gcccnccccc ctnntatttt aagctgagaa tctgaaggnn 770 



<210> 751 
<211> 774 
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<212> DNA 

<213> Homo sapiens 

<400> 751 

cgttnnnttt ccncctttga agcocttttt gcaggacttt cnaatncttg gtagacttta 60 

tgtcagttct gtgtagactt tatgtcagtt tttgtcatta tttgaaaatc tattctgaca 120 

actttttaat tcctttgatc ttataagtta aagctgtaac aactgaaatt gcatggatca 180 

agtaagcata gttttatcca gggagacngc tcnnnggaag ccatagaatt gctctggtca 240 

aaaccaagca caccatagcc ttaactgaat atttaggaaa tctgcctaat ctgcttatat 300 

ttggtgtttg ttttttgact gttgggcttt gggaagatgt tatttatgac caatatctgc 360 

cagtaacgct gtttatctca cttgctttga aagccaatgg gggaaaaaaa tccatgaaaa 420 

aaaaaagatt gataaagtag atgattttgt ttgtatccct acccatctcc tggcagccct 480 

actgagtgaa attgggatac atttggctgt cagaaattat accgagtcta ctgggtataa 540 



600 



catgtctcac ttggaaagct agtcctttta aatgggtgcc aaaggtcaac tgtnatgaga 
taattatccc tgcctgntgt ccatgtcaga cttttgagct gatcctgaat aataaagcct 660 
tttaccttat ctggaaaaaa aaaacattnt anancaaaaa aaaactnnga gccctttana 720 
actnttagng agnccntttt ccgtagaatc ccngacntgg ntaaggaanc nnnc 



774 



<210> 752 
<211> 778 
<212> DNA 
<213> Homo sapiens 

<400> 752 

gntttgaann ccnttgtttc gnatcctttt tgnaggactc tgaagncctt tgttcggcnc 60 

gagaagaaac tctgcctcag aaaatgttta cagcttccag tggaatcaaa cataccatga 12 0 

ccncaattta tccaagttct aacacattag tagaaatgac tcttggtatg aagaaattaa 180 

aggaagagat ggaaggggtg gttaaagacn ttgctgaaaa taaccacatt ttagaaaggt 240 

ttggctcttt aaccatggat ggtggccttc gcaacgttga ctgtctttag ctttctaata 300 

gaagtttaag aaaagtttcc gtttgcacaa gaaaataacg cttgggcatt aaatgaatgc 360 

ctttatagat agtcacttgt ttctacaatt cagtatttga tgtggtcgtg taaatatgta 420 

caatattgta aatacataaa aaatatacaa atttttggct gctgtgaaga tgtaatttta 480 

tcttttaaca tttataatta tatgaggaaa tttgacctca gtgatcacga gaagaaagcc 540 

atgaccgacc aatatgttga catactgatc ctctactctg agtggggcta aataagttat 600 

tttctctgac cgcctactgg gaaatatttt taagtggaac caaaataggc atcccttacc 660 

aaatcaagga agactgactt ggacaccgtt tggaaaatgg gtaaaaacgg tggnttactg 720 

gtganttggg gagcnagaac cggacccact ggtatactgg ggantaacaa tttttttc 778 

<210> 753 
<211> 775 
<212> DNA 
<213> Homo sapiens 



<400> 753 

gcttttgaaa cccttttgtt aacgcctttc tgcatgatct tctcgtcctt gaaagggccc 
taaaagagat gaacaatacc gtatcatgtg gtttgaatta gaaacccttg tcagagccca 
tatcaacaac tcagagaaac atcaaagagt cttggaatgt ctgatggcat gcaggagcaa 
acccccagaa gaggaagaac gaaaganacg cggctgaaag agggaagaca aagaggacaa 240 
gtcagagaaa gcagtgaaag attatgaaca ggaaaagtct tggcaagact cagagagatt 300 
aaaaggaatc ttagaacgtg gaaaagaaga attggctgaa gctgagatta taaaagattc 
gcctgattcc ccagaacctn caaacaaaaa accccttgtt gaaatggatg aaactccaca 



60 
120 
180 



360 
420 
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agtggaaaaa tcaaaagggc 
tgccaattcc agaaaacatc 
ctgaactata tcaacatctt 
ccagccggca gtgaagagtg 
gtgcgggaat tcgatatnag 
ctggntttta catttttnaa 



cagtgtcgtt attatccttg 
aggaatttgc tggaccgttt 
aaagaggaaa atgggatgga 
acttgangaa ctaaatttta 
tacttttacc agcaagatgg 
attttttcag tgnccttttt 



tggagtaata 
gaactctgtt 
gacaacagaa 
gcatattgca 
natngttatg 
tggtcctaaa 



gaatcaatac 
aataacagag 
aatggaaaag 
aaaatatttt 
tttgcctgga 
ttatc 



480 
540 
600 
660 
720 
775 



<210> 754 

<211> 1032 

<212> DNA 

<213> Homo sapiens 

<400> 754 

ggnntttttc ccaaaaaaaa ggggccccct nggggntttt tncncaanng gncccctttt 6 0 

tctttgncca gggnaacntt ttttgngaaa aganccccct ttttggatnn accggggccc 12 0 

cccggaaggt tccnaaattt tnagggttna aacccaaatc cttggggaaa aaaaaaaaac 18 0 

ccagggccnt ntntggggnc cccctngggg gggtngggaa aaaaaaaaaa gggggaatgg 24 0 

cccccaaaaa aaaatrmggg gcccctnggg ggaaaaaaaa gggaaagccc aggtngggaa 300 

nggaaagggg gaaggntccc ccggggggaa aggaaatggg tgggtnggna atggcccaat 360 

ggttggaaaa ggcccaaacc aatttgggnt ntaaaacaat ttcaacctgg gggggtcctg 42 0 

gcccanaaaa aatgcngggc acccncgngg ggtctggctt aagaattggt tacaagggca 480 

aagggaaagg gaagagttct agagataaag aactatatgc ttggatgaag tgtgtgaagg 54 0 

gacagcctca tgatcacaaa catttaatgc caacccaaat tatacctggt tctgttttga 600 

cagatcttct agatgccatg cacactctta gggaaaaata tggtattaaa tcccattgnc 66 0 

attggactaa caaacagaat ttacaagttg gaaattttcc tacaatgaat ggtgtatctc 72 0 

aagttttaca gaatgntctt aatcacagna ataaaatttc tctgtgcatg cctgagtctt 780 

cagcagcaaa aatactcctc cgaagtctga gaaaaatggn ggcagcagcc caagaagagt 840 

gatgtaggca cagataacna aggntaacct cctccagaat ccccagtcac cactgcactg 900 

gttaagcaga acttngcagg agcaaaaaag cccngangan ggaaaaaaaa aannaaaaaa 960 

aactcggagc cctcttagaa ctatangggg ggccgnnnta ccgnangatc cccgacctga 1020 



<210> 755 
<211> 798 
<212> DNA 
<213> Homo sapiens 

<400> 755 

ngnnnnnttt nncccnacna aatccctttt ttgaagcctt ctantgnctt catcgtnctg 60 

gtaaattggn tgaattattg tattgaagct tgagctgtat tttnaagtaa tttnggttnc 120 

ccctaagatg ttattatgtt agggacataa cacttttggg aggttgttgt gggagatggt 180 

tgatttaggt tttcaaaagc tagaaataaa atttacatnn ccccggntnn cataaaattc 240 

tgctctaatt gggtggaagg tgctgtatct aacttgtgtt cctnctaagg ttatgtccta 300 

ataactattc ttttaggagt atacttctac tttatagaag gttgcttttt ctttttaatt 360 

ttntctaaca aagaaaagaa tnaagtattt attaataaag aaccagaaag cacttgaaac 420 

tgatgttttt aaatgggctc acttanggta gatttattta tctcattaac ttaaaaacag 480 

ctatgtgnat tgaaataagt cacaacagaa cttgaacacc agggtgggtg tctgagcaat 540 

cccctttctt atggggaaaa acaaatggtt cttgtttgaa cangaaggta tcattgcagt 600 

cngcattcac ccgtgtataa ttgnnatata agntgnataa tatgctcgta aaggctnaag 660 

gtnagctgga tctggatgcc ctttnaccaa ttangatttt aacttttaan aataaaattt 720 

naaanctaat tgncnaaata aaaaaaatan naaacttcgg ncctctacaa nttntagatg 780 



anaggaaccc 



cc 



1032 
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ngtcgattnn cgnncanc 19Q 

<210> 756 
<211> 834 
<212> DNA 
<213> Homo sapiens 

<400> 756 

tttgaaaccc ntttnttnaa gcctttttaa tgactttanc gncctttatt cggcacgagg 60 

tccttcagct ggtagcttnc attcgnantt nnanatanta tntgtgcatg cncnnttgaa 120 

tttttgtgga agaacagant gcagaagaag gcnaggaaag ccgaagagan tnntncggca 18 0 

ncagaagctt aaagnaggcc aaactggtgg tgcnctttcc tcggnacaga agctggatga 24 0 

ctatggccaa tttggagaaa nagctccagg agatggaggc acggttcgag aaggagtttg 300 

nagatggatc ggatgaaaat gaaantggaa gaacatganc tcaaagatga ngatggatgg 360 

taangacagt gatgaggncc gaagacnctg agctctatga tgacctttta ctgnccanca 42 0 

tgtgacaaat cgtnaanaac agtaaaggcc atgaanaatc acntagaagt caaangaaag 480 

cnnttgggaa aaatggnggn nctttgntaa aaccacnagc tgganggang gaagaannna 540 

aaatttttta agnacctcaa attgattgaa aaatncatta tgatgacaat tcctgnanga 600 

ataaattggn agatgcncta naancaaaan gcnttttttn antnnaaana nacaaannnt 660 

nnagcctntt ngaacntata gtnnannctn cntttanctn tntatcccgg actttnttnt 720 

ggataccntt gactnagctt ttggacaaaa ncncnacttt gtattncatt ngnnaaaaaa 780 

atgcntttat ttttcgnaaa tttggtgaat ncntaattng ntnntattnn nnnc 834 

<210> 757 

<211> 1062 

<212> DNA 

<213> Homo sapiens 

<400> 757 

tttttccaaa aaaatcnccc ccnttttttg gccttnaana nanngggccc cctttttttt 60 

gggccagggg aatnccccca atnccggaat tttccggggt ntttgggggg nttggaaagg 120 

gccccttggg gaaagggncn tttccnaagn aaaggggtng gaaaattttn taaatggcct 180 

tttngggggg aaaaagcccc ctnggaatnc ccccaaaaaa cccttgggaa aaagggggga 240 

aaaagggggg aacctttngg gnaatccttn cccnttnaat aatttggggn aattaaancc 300 

ctggtttggg aaagggaaaa gggttggtct tggtcttggg ggaanggaat tgggggccaa 36 0 

nttaaaatgg aaggtttggc canaatnggc cncttcgggg gcttnttcaa aagccaagcc 420 

tttgggancc ctgcttcatt tttngggccc tttnctgcca aggaanccca acccttaact 480 

tancaggaaa anggagatga aaggccttct tccaaggaag gtaaggtcct ttggctgccc 540 

enact taaat gctttttgaa antctcttag atgtggnaaa tattttttcc gaaccttgaa 600 

atcaactngg tagaatttca attggaagca taatccattg taaaatatat tttagttgat 660 

atttggtaaa atgccttttt tggtggtgtg gttngaatcc tgggtttccc aagaatcttg 720 

natttcaaat ggtttaacaa angggaagga aaggganctt ttcccttaac cttccctttt 780 

tgaccaggaa agattttnaa aagtaccttt ctttttaagg aaaaaaaaaa attaaatttt 840 

gaagaaaaat tgggatttgg attttanaaa aaangggaaa aaaaatatna ntattnatan 900 

ntennannat nnttnatnnt ctanntantt nctntnnnta ntnctnntnt ntnnannnna 960 

nannnannaa ataaatantc nnncatnett anctacanat neenntcttn nttntannac 1020 
tttnannnta nntatctaan tctntcccta ttntaccctn nc 



1062 



<210> 758 
<211> 845 
<212> DNA 
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<213> Homo sapiens 



<400> 758 



aaancccttn 
tagcgtcggn 
tntgcttacg 
gaaggtgatt 
tgntttggac 
tgtgttcctg 
gtccacggca 
natcaagcct 
ggattgggtg 
aaaaactttt 
tgggcctatt 
aaantcttnn 
ntaacnttta 
tttaaaattc 
tgctn 



naaaaaaaan 
agccttttta 
attaaaggaa 
ntntngaaaa 



tttnaaatcc 
tccgcntggg 
ggcaaaaacc 
gcctttgggg 
agaantggga 
tcantcacgg 
tgcccaattg 
aatgtgggag 
ggantggang 
gggtttttaa 



tttttanang attcatcgat tcgaattcgg nacgaggcgc 
cccttgcggt gcgctgnggg caggcggtga ggcttacgcn 
tgcacacgca ccanttcccg tnnccgttgt ccaacaacca 
aancttctan gncaacnacn tgaacntatg gacagtgcgc 
acnttnaggn tgntgtgcgc ttcnagcatn tgggcacctt 
gtgagcanta tggaagcccc atccgtgggg cagcatgaag 
caacacgcac aaatacttgg aangccatgg aangcatntt 
cccttttgca agtcacgaat taactctnaa nngtntggat 
gttgcaagtt ngggccnttt tgaaaggcca ctttttggna 
ngggttcntc aaaatgccct ttgnnaattn aaagaaatgt 
atnatacttt atntaatctn nataataata nttantaata 
aaanttttta atgaanctct ttattttanc gttanantnc 
taacaatttg ttgaantttt ggtataaana ncccccantt 
aaaatncntt tattttggta aaaatttgng gaatcnnttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
845 



<210> 759 
<211> 947 
<212> DNA 
<213> Homo sapiens 

<400> 759 

tngggggggg ccccnanttt ggggccccaa acccttnggg gaaacccccc ttnnnnnttt 60 

ttnccntttt gggggggaaa ngcccccccc caaangnaaa aacccntttt nnnnaatttn 120 

ngggnanggg ntntggggnc ccnttaaccc caangggggg gggttttnan cctgggggnn 180 

naaaatnggg ggaanaantn nnnaatgggn antcccttna angggaaaaa naatttnncc 240 

ttaaggnnat gggncattaa tnttnatccc tantggattn caatttcatt cgnattaaag 300 

gcttttactg gnataatcct tnncggcccg cnctggtagt ttaaagtgcc canaanttga 360 

atgggaaatn acgggttttg aaaatcgcac aaagcagtgc cnggcacnga ggngtcacgc 420 

cngtaatncc agcattttgg gaggcctgag gcangcggat cacganggca anagagtcca 480 

gaccattnct ggctaacacn gggaaacccc gggnctaata aaaaatcaaa aattaggntg 54 0 

gacatggtgg cacgtgccng taatcncagc tacttangga agctggatgc aggaagaatt 600 

gcgtgnnanc cnggccccng tggaangntg cattgatacg aagaaccgtg ccaaatgaan 660 

ttanannctg ggcngaannn gagcggaaaa agccctnttt aaaaaaaaan gggantggaa 72 0 

aaantggtgc canagncatn nggggaaaaa attttnnnnt tnnttnancg gttttnanct 780 

tgnggaaggc cntctttaat nttggggaaa aggcactttt gggntnggtt ttggaaaacg 84 0 

nntggctttt ccctttnaaa agggaaaaan ggnnttaanc ccctgaaaaa ngngcngnnt 900 

tttaaanggg gnnnnaaaca nggggncttt ggaancccca nnaaacc 947 

<210> 760 
<211> 759 
<212> DNA 
<213> Homo sapiens 

<400> 760 

gnntttctaa tgcttgtnnn nngcntttnt gcaggatccc atcgattcga attcggcacg 60 

agaagatatg cagagatatt ccaggatctt ttagctttgg tgcggtctcc tggagacagt 120 

gttattcgcc aacagtgtgt tgaatatgtc acatccattt tgcagtctct ctgtgatcan 180 
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gacattgcac ttatcttacc ggctcttctg aagggtctat ttctgaactg gagcagctct 24 0 

ccaattctct accaaataaa gaattgatga cctcaatctg tgactgtctg ttggctacgc 300 

tagctaactc tgagagcagt tacaactgtt tactgacatg tgtcagaaca atgatgtttc 360 

ttgcanagca tgattatgga ttatttcatt taaaaagttc tttaaagaaa aacagtagtg 420 

ctctgcatag tttactgaaa cgagtggtca gcacatttag taaggacaca ggagagcttg 480 

catcttcatt tttagaattt atgagacaaa ttcttaactc tgacacaatt ggatgctgtg 54 0 

gagatgataa tggtctcatg gaagtanaag gagctcatac atcacggacg atgagtatta 600 

atgctgcaga gttaaaacag cttctacaaa gccaaagaag aaagtncaga aaaatttgtn 660 

ccttgaacta gagaaacttg ntntggaaca tttcaaaaga tgaatgacaa tctggattcn 72 0 

ttggtngaca gtgtaatttg gactttaacc ngatgctcg 759 

<210> 761 
<211> 752 
<212> DNA 
<213> Homo sapiens 

<400> 761 

cccnactaaa cctttgcnaa ngccnttnnt gctgatccca tcgattcgca ggcctggact 60 

tcgcccccag gcctaggacc gcggtgggtn ttaaccctgc tnctgcccca acagggactc 120 

caatcaatcg gagttctccc cttgccggag ctgcccttca cctttggggc ccgagacagt 180 

cataagggat ggacttacnt ttcttgcagg gaaaaaggtg gacagccgtg tttcttaagg 24 0 

atgctgaggg catggggcca ggaccagggg agaggcacag ctccttcctg agcagcctct 300 

caccactgcc acaaggctcc ctaatgctgg tctctgctcc actccccggc ttcccgtgag 360 

gcangaggca gagccacagc caaggccctg accacttctg tgccagttgt ctaagcagag 420 

cgcctcaggg acgctggaaa tgccttaagg atagaggctg ggcatcacat caaatgggac 480 

tgtggtgttt ggtgaaaacc ttcctgagga tctggattca ggaccctcca tgactggcct 540 

atttactggt tacagctggc cagtgcanan ctgctgctct tttacctttt taggcccctg 600 

taacttncca cctttaaact gcccaanaag catgcctntt ccacaggaag aagggagcag 660 

acagggaaat ctgcctacca anaagggtgt tgtgtgtctt tgtgcccaca cgtggtggct 720 

ggggaatgcc tggatggtgc cgtggntgat ct 752 

<210> 762 

<211> 1032 

<212> DNA 

<213> Homo sapiens 

<400> 762 

ttctaatgct tggaaacgcn ttgatgnang atnccatcga ttcgaattcg gcacgagggc 60 

aagtggtagt ggcgcttntc gggtgntgtg cttcacgttt tggtctaaag gncgagactg 120 

ttgtggcnac ngngnaantn tacnggaang gnttaaantn tnnntgnagt nggaanaatt 180 

cnatcngaan gaanttgggg gggntagnnn nggttanatn attgatgaat ggnttcaana 240 

tngnaaantt tatnancgan atgnnatant tnnaaangan gaccaactgg gntnanatgg 300 

agnannnatn aannggntaa ncnatanana tantncattt ggtanganaa tngangaagg 360 

attntcaaat agncatgtng gangatgaac ntnnaggnnn nagaatattt ggataaaatt 420 

ggtantatga agatntggnn taataatacc nanaaatnnn nnantttnat nanngangaa 480 

ntagganttn atgnctatgn ggatannntn nanntatnat agngataaan tatgatactg 540 

tttannntat ntnganttag tnattnaatg ntcttgtnan aanttatttt ncgntagtta 600 

gntagnnnta tnnactttgg naancanana tgtaattctc tctanacggg aatntttnta 660 

tnntnnntat caagaggtnt ntnnattgna aatantatac nnttgnanaa antatatcna 720 

tanaanaaan ggnnattatt ntatatganc aaanaaaaaa ntattgngga nntanattat 780 

ctctcatnat ngattatncn gtantgtata atggnnnata antatgtnnn tntaanataa 840 
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atggatataa gtnttatant atgcncntna aggnggtcng anaantatgt aattatattn 
angctanata cnatnnanat gtntnactaa atatngntgt gaaangtntg cgnggnaaaa 
tntgttanta ntnaaacang gtataganat atanatgngn ngaatatcta ctatntgtan 
atacttatan ca 



900 
960 
1020 
1032 



<210> 763 
<211> 817 
<212> DNA 
<213> Homo sapiens 

<400> 763 

aanncccttn tttctaatnc ttggctactc gtnctttctg caggatccca tcgattcgaa 60 

tttcggcacg aggggaggga cccttggggn caggttgtgg gtagccagtt gcagtctgtg 120 

gcctccctca gaggtttgga gtcgggcgtg gcatgctgct gttggcctct ttccgaggga 180 

gtgccatcca ctccctgtcc caccgctnnc cctngtgagg acagtgaggg cagtgctacg 24 0 

tggtggggag gtgtgtgtga agccacggaa gggcttcaca gggcaaatgc caaggccagt 300 

gggccccgga cagagtnagg ctccctgggc ggncttgtgt cttggtggcc ctgatcatcc 360 

tgccaatgca naaagccagc aggcaagaga cccctactcc ctttaaggac cattagcata 420 

aacaaaccat tgngttgaat gcaatgatcc aggtgcactt tnagggtaca agctggactn 480 

gttggaacag gattacatgg aaaannggaa angggggcan gctgtctctt gggacatnag 54 0 

taatgtcttt ttacccantt gncactctng aanttcaaan ttggncatgt tttctggggc 600 

ctnctngnaa aagcagtttt ttcaccncat natgaagaaa aaacttgttg gcttgganng 660 

tanngggatt nttgntnana cttnccctaa anggntncct ttnggggcat ttntgaaggn 720 

taaataatgg gggatacctt tttaannttc cttgcagatt taaaaatgtt ccttaaanga 780 

nncctcaatg nttnggtctt nttccaaaaa acnattc 817 

<210> 764 
<211> 777 
<212> DNA 
<213> Homo sapiens 

<400> 764 

taatgcttgg ntctcgnttt tntgcaggat cccatcgatt cgaattcggc acgaggtcca 60 

cggtgctgaa catcatcatc tttgaagact gtaggaacca gtggtctatg tcccgaccac 120 

tacttggctt gatattgctt aatgaaaagt atttttctga cctaagaaac agtattgtga 180 

acagccagcc acccgcgaag cancnggcca tgcacctgtg ttttgagaac ctgatggaag 240 

gcatcgagcg aaatcttctt acgaaaaaca gagacaggtt cacccagaac ctgtcagcat 300 

tccgtcgaga agtcaacgac tcaatgaaga attccactta tggcgtgaat agcaatgaca 360 

tgatgagctg acacctnctt ggactctacc tgtacagagc agcgtccctt tggtttggcc 420 

cagaggggcg aacaattgca agggagaggg cctggctgat cctggctctt ttctccaggg 480 

gtgtggggaa aaatggcaaa ggtcaactag ctgcttcccc aagggaatag gggtgtgagt 540 

acactcacta nggggcaagg cgctgcttgg ttcctggggg gactgggtgg gaaagggtgg 600 

tgnganggag ataaagagat tcaaactgag actccagtct ttccttctgg gggccaccca 660 

aagttgggga gnaaccccct antggtncct gccaacaacc ttgccttggg attaaacatt 720 

ntncattttt ttcantaana tttttgaaca aagggttant attgnctnaa gtttann 777 

<210> 765 
<211> 774 
<212> DNA 
<213> Homo sapiens 
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<400> 765 

ntttctaatg cttggctctc gntttgatgc angatcccat cgattcggga aatgcaagtc 60 

aaaacagctt tgtaggtctc agagtttgct tttaagaagt agtacaagaa ggaatagtta 120 

tatcaataca ccagtggctg aaattatcat gaaaccaaat gttggacaag gcagcacaag 180 

tgtgcaaaca gctatggarm gtgaactcgg agagtctagt gccacaatca ataaaagact 24 0 

ctgcaaaagt acaatagaac tttcagaaaa ttctttactt ccagcttctt ctatgttgac 300 

tggcacacaa agcttgctgc aacctcattt agagagggtt gccatcgatg ctctacagtt 360 

atgttgtttg ttacttcccc caccaaatcg tagaaagctt caacttttaa tgcgtatgat 420 

ttcccgaatg agtcaaaatg ttgatatgcc caaacttcat gatgcaatgg gtacgaggtc 480 

actgatgata catacctttt ctcgatgtgt gttatgctgt gctgaagaag tggatcttga 540 

tgagcttctt gctggaagat tagtttcttt cttaatggat catcatcagg aaattcttca 600 

agtaccctct tacttacaga ctgcagtgga aaaacatctt gactacttaa aaaaanggga 660 

catatttgaa aaatcctggg agaanggact atttggctnc ttttgccaac ttacttcata 720 

ctggnaagcc agattantng ctcaaggaag ttttgatgag ccaaaaaagt tttn 774 

<210> 766 
<211> 779 
<212> DNA 

<213> Homo sapiens 
<400> 766 

ttnnncgctn ntgaanaccc cttctcctna aatccttttt aantnccttg ctgnntgatc 60 

ccatcgattc gcgaaattcg gtggcgccac gtccgcccgt cttngccttc tgcatngcgg 12 0 

cttcggcggc ttccacctag acacctaaca gtcgcggagc cggccgcgtc gtgagggggt 180 

cggcacgggg agtcgggcgg tcttgtgcat cttggctacc tgcgggtcga agatgtcgga 240 

catcggagac tggttcagga gcatcccggc gatcacgcgc tattggttcg ccgccaccgt 300 

cgccgtgccc ttggtcggca aactcggcct catcagcccg gcctacctct tcctctggcc 360 

cgaagccttc ctttatcgct ttcagatttg gaggccaatc actgccacct tttatttccc 420 

tgtgggtcca ggaactggat ttctttattt ggtcaattta tatttcttat atcagtattc 480 

tacgcgactt gaaacaggag cttttgatgg gaggccagca gactatttat tcatgctcct 540 

ctttaactgg atttgcatcg tgattactgg cttagcaaat ggatatgcaa gttgctgatg 600 

attcctctga tcatgtcagt actttatgtc tgggcccanc tgaacagaga catgattgna 660 

tcatttttgg tttggaacac gaatttaagg cctgctattt accctggggt atccttggat 720 

tcaactatat catcggangc tcngtaatca atgagctaat tggnaaattn ggtggacac 779 

<210> 767 
<211> 799 
<212> DNA 
<213> Homo sapiens 

<400> 767 

gnnnnnnttn cccgcctttn gaaancccct tctttctaat gcttgttcaa cgcctttgct 60 

gcaggatccc atcgattcgt ggatactgac aatggtggca ggcatttcaa gccttttaaa 12 0 

ttagtacttt ttgtcgnctt gcttattaaa attttgttaa ttttagcaaa gaccaattgt 18 0 

tgtgataaac tggtgttttt nggatgcttc aagcacacgt taaccaatcn gccaatnccc 240 

ctttnggttc ctcccattgn tctaaaatag gactttcata ttattaaaac ctcaaaagat 300 

gatccaccca ggatgaacaa agatcaccaa ggggaaagaa aacatttttt atctttacag 360 

aaaacatgtt aagattatat atagatgtat tctttacatt ggatattgta ttagagtcct 420 

ccttacaaga aatgaaatag gtttttagca ctcttagcat tagagttcct agattggtgt 480 

tgatagctac agttttaaaa tgtataacct gaaaatgaag gttaattttg cattgtaaag 540 

agcacatttg atctatgtaa aaagtgtcca tttggtgtat tttttttaaa aaagagaaag 600 
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cactttcata ttaagtagca tgtgtatgaa tttaagattt tcatatttgn tgngtctggt 
attcagtgaa gtaaaattga gcattttaaa agtttggtgg atggcaacca ttaactatta 
aattaaaagc caccttatac tctgctgctt aacttgcttg naaattgcac ctttggnacc 
ctgcacattt tcatattnc 



660 
720 
780 
799 



<210> 768 
<211> 826 
<212> DNA 
<213> Homo sapiens 

<400> 768 

gnnnntnncn ccctttctaa tggcttgttt ctaaatgctt tttcnaatcc ttggtacatg 60 

atcccatcgn ttcgcgctgt gcttgagacc aacctgacgg gtaccttcta catgtgcaaa 120 

gcagtttaca gctcctggat gaaagagcat ggaggatcta tcgtcaatat cattgtccct 180 

actaaagctg gatttccatt agctgtgcat tctggagctg caagacnggg tgtttacaac 240 

ctcaccaaat ctttagcttt ggaatgggcc tgcagtggaa tacggatcaa ttgtgttgcc 300 

cctggagtta tttattccca gactgctgtg gagaactatg gttcctgggg acaaagcttc 360 

tttgaagggt cttttcagaa aatccccgct aaacgaattg gtgttcctga ggaggtctcc 42 0 

tctgtggtct gcttcctact gtctcctgca gcttccttca tcactggaca agtnggtgga 480 

tgtngatggg ggccnggagt ctctatactc actcgtatga ngtccagatc atgacaactg 54 0 

gcccaaggga gcangggacc tttctggtgt caaaaaagat gaaaggagac ctttaaggag 600 

aaagctaagc tcttgagctt gangaaaaca aggggtcctt ccatncccca aatgccttta 660 

catttttgga ggatatgcct nnnggnacnt ttttaaaaaa gcttatnagt tngntatggg 720 

naaaacaatt ttttccttan tttttaaagt ggntaataaa tnaaantcct aatggnaaaa 78 0 

aaactantcc ttggnaanta ttttccaggn cttnantgtn cccncn 826 

<210> 769 
<211> 802 
<212> DNA 
<213> Homo sapiens 

<400> 769 

gnnnttctaa tgctgttcta atgcttgtca atncttgana cgttcatcga ttcgggaagc 60 

caagcctgga gctgcaggtc ccccggcatc tctctctgtc ccggcagccc aggatggcct 12 0 

ggtgccccca cctgctgcag caggagcccc aaggagtgct agctgagggt ggttgctggg 180 

gtggtcctca tggacagtga ggtgtgcccg ggtgcactga gggtggtggg aggggatcac 24 0 

ctgggttcca ggccatcctt gctgagcatc tttgagcctg ccttccggtg ggagcagaaa 300 

aggccagacc ctgctgagtt agaggctgct gggatccact gtttncacac agcgggaagg 360 

ctgctgggaa caggtggcag agaagtgcca tgttngcntt gagccttgca gctcttcagc 420 

tggggactgg tgcttgctga aacccaagag ctgaacagtg aggaggctgt ccaccttgct 480 

tggctcactg ggaccaggaa agcctgtctt tggttaggct cgtgtacttc tgcaggaaaa 540 

aaaaaaagga tgtgtcattg gtcatgatat ttgaaaaggg ggaaggangc cnaaanttgt 600 

tcccatttta ttcaagtatt ggaaaatatt tggcccccct ttggctgaaa ttctttttgc 660 

aanaactaac tgngtggctt gttcncttac cctttttcan gnttaattgg tttnaatttt 720 

ttgcattgaa attaaagacg tttttaaatt tcntttncaa naacaaaggg cttanatncc 780 

ngantcnana nattggnant tc 802 

<210> 770 

<211> 1157 

<212> DNA 

<213> Homo sapiens 



242 



BNSDOCID: <WO_ 9933982A2J_> 



WO 99/33982 



PCT/US98/27610 



<400> 770 

cccttttttt tttttcccnn aaaaaaanat tggggncccn tttttttggg nttttttttc 60 

ccnaaaaaaa aattgggncc ctttttgggg ggnntnaaaa aaannnnnnn nccccccntt 120 

tttttggggn nnnnnaaann tnnnnnncnn nttnnnnnnn nnnnnnnnnn ggnnttnnng 180 

gggnnnannc cnccccccaa tttcccggnn attnttccgg gcccaatttt tgggaccccc 240 

cagggnnnag aataaggccc ggggnttttt tttncnaggg ncccaaaagg gcccttgggc 3 00 

caaaggnaaa tccnttggga aattttggga atttggccct tggnanntcc caataccggn 360 

aaaaatgggg aaangnaaaa aaggnttncn ccaaattggt tggggggggg ttccaaagat 420 

tttcattggg ggtncntggg ctttcaaccc naaggnaang ggtttncttt caaaaaatta 4 80 

cctttaattg ccattaagca attcccaang gttannaaag ggtgtttntt ctcanctatg 540 

cttcganagn gaaaatcaac naatggaaaa tgtgttgtaa ttggtctgca ntctacanga 600 

gaagctagaa cattagaagc tttggaanag ggcggnggag aattgaatga tntttgnttc 660 

aactgccaaa gagtgttgtt gcagtcactc atttgaaaaa ctattttcct gctccagaca 720 

ngaaaaaaac tttatangtt tactaggaat cgatttgaca agcnttcang taacaaacag 780 

ttctnccaag agatatcctt gttnaagaan nattanaata ncnngaaagc ggaaanngtg 840 

aataaatnnc ttcnagaagc ccaaaaannc acngaanaag tatggtgggn cttactggtt 900 

agcacgttct tgacnacaga tggaaattga antctngatt ncctctgatt antgaatgaa 960 

aaggcgacta tcnaanagcc cttnanatac catgagtntt tggancattg attgaccaat 1020 

ttcaanncca tttttangat ngaattntta tnaatgattn attnanaant gannnccttn 1080 

gtttaaatta nnaaanaanc cntcnaaana cnanagggga tttataaaat ctaataanan 1140 

ttttnnncnt ntnaann 1157 

<210> 771 
<211> 760 
<212> DNA 
<213> Homo sapiens 

<400> 771 

ngncctttna tnccttntga anccntttgn aattnctcnn nnngttgatc ccatcgattc 60 

gaattcggca cgaggtggaa gaaaattttt tgctgcttct ggttnccaga aaagggagcc 120 

attttaacag acacatctgt caaaagaaat gacttgtcga ttatttctgg ctaatttttc 180 

tttatagcag agtttctcac acctggcgag ctgtggcatg cttttaaaca gagttcattt 240 

ccagtaccct ccatcagtgc accctgcttt aagaaaatga acttatgcaa atagacatcc 300 

acagcgtcgg taaattaagg ggtgatcacc aagtttcata atattttccc tttataaaag 360 

gatttgttgg ccaggtgcag tggttcatgc ctgtaatccc agcagtttgg gaggctgagg 420 

tgggtggatc acctgaggtc aggagttcga gaccaacctg accaacatgg tgagaccccc 480 

gtctctacta aaaataaaaa aaaaattagc tgggagtggn ggtgggcacc tgtaatccta 540 

gctacttggg aggctgaacc aggagaatct cttgaacctg ggaggcanag gttgcaagtg 600 

agcccgagat cgtgccattg cactccaacc agggcaacaa gagtgaaact ccatcttaaa 660 

aaanaaaaan gaaaactcga gcctctagaa ctatagtgag tcgtattacg tagatccaga 720 

catgataaga tacattgatg aattttggac aaaccccann 760 

<210> 772 
<211> 777 
<212> DNA 
<213> Homo sapiens 

<400> 772 

gaaancccat ttnnnnnttc cncttcnaat cccttggnta ctcgntcttt ntgcaggatc 60 

ccatcgattc gaattcggca cgagctctac taaaaataca aaaattagct gggcgtggtg 12 0 

gcacacacct gtaatcccag ttacttggga ggctgaggca caagaatcgc ttgaacccgg 180 
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gaggcggagg 
gagactctgt 
atattttaaa 
cccagataaa 
ctacttttcc 
taattgtgtg 
tgtctccagc 
gngtttcaaa 
caatgtaata 
tctagaaact 



ttgcagttag 
ctccaaaaac 
aattgttctt 
cagtgtattt 
agtagtttgc 
tcaaattttg 
actatgcatc 
ctggtatttt 
aagttaatta 
atangtgagt 



ccaagatcgc cctgctgcnc tccagcctgg gcaacagagg 
aaaaacaaaa actgttagtg aaggttccct gggacttttg 
atgactagta gataaattca ttgccataat gaggctagct 
tcttcttttt tttttttggt gagtggtcca gagctttaag 
cactttctcc gaggtanttt ggctgctctt tcagtaatgc 
tctacaacag taggcaacag atgaagataa gttggttgaa 
cctattttct atttattggg gtacactcac tttcagtaat 
ttaaaaaaca aatcaatgta aggactgaag ttgaaatanc 
gggttatttt taaaaaaaan aaaaataana actcnagccc 
cgnnttacct tgaatcccag accttgataa gatacnc 



240 
300 
360 
420 
480 
540 
600 
660 
720 
777 



<210> 773 
<211> 782 
<212> DNA 
<213> Homo sapiens 

<400> 773 

gnntnnattc ccctttcnaa tncttggcaa acgctctctn tgttggatcc catcgattcg 6 0 

aattcggcac gagacagtct cgggtttcat attttgctgt ttttgatgga catggaggaa 12 0 

ttcgagcctc aaaatttgct gcacagaatt tgcatcaaaa cttaatcaga aaatttccta 180 

aaggagatgt aatcagtgta ncncccgccg tgaagagatg ccttttggac actttcaagc 24 0 

atactgatga agagttcctt aaacaagctt ccagccagaa gcctgcctgg aaagatgggt 3 00 

ccactgccac gtgtgttctg gctgtagaca acattcttta tattgccaac ctcggagata 360 

gtcgggcaat cttgtgtcgt tataatgagg agagtnaaaa acatgcagcc ttaagcctna 420 

gcaaagagca taatccaact cagtatgaag agcggatgaa gatacagaaa gctggaggaa 480 

acgttaaggg atgggcgtgt tttgggcgtg ctagangtgt cacgctacat tggggacngn 54 0 

cantacaagc gctgcngtgt nacctttgtg ccccgacatc agacgctgcc agctnacccc 600 

caatgacagg ttcattttgn tggccttgtt atnggctctt naaaggnctt tnccccatna 660 

aggaagccng tggaactttc atcttgncct gnantcgang atnaaaaagn atncagaacc 720 

cggggaaggg gaaaatcctn aannctgact tcccggtttc caaaccagtn ttgnaacaaa 780 

nc 782 

<210> 774 
<211> 793 
<212> DNA 
<213> Homo sapiens 

<4O0> 774 

gnannngccn cgnttttgat tccccttntt caaatccttt gnnaatcgcc ctcnctgttt 60 

tgatcccatc cgattcgaat tcggcacgag atggcagttg cttttgaagt atatgatgnn 120 

ttcctccact acaaaaaggg gatctaccac cacactggtc taagagaccc tttcaacccc 180 

tttgagctga ctaatcatgc tgttctgctt gtgggctatc ngcactgact cagcctctgg 240 

gatggattac tggattgtta aaaacagctg gggcaccggc tggggtgaga atggctactt 300 

ccggatccgc agaggaactg atgagtgtgc aattgagagc atagcagtgg cagccacacc 360 

aattcctaaa ttgtagggta tgccttccag tatttcataa tgatctgcat cagttgtaaa 420 

ggggaattgg tatattcaca gactgtagac tttcagcagc aatctcagaa gcttacaaat 480 

agatttccat gaagatattt gtcttcagaa ttaaaactgc ccttaatttt aatatacctt 540 

tcaatcggcc actggccatt tttttctaag tattcaatta agtgggaatt ttctggaaga 600 

tggtcagcta tgaaagtaat agagtnttgc ttaatcattn ggaattcaaa catgctatat 660 

tttttttaaa aatcaatgtg aaaacataga cttattttta aattgntacc aattacaata 720 

aaaataatgg gcaattaatt tttnaaaact ttttaaaata gnatgctcat atttttaaaa 780 
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ataaaanttt tnc 



793 



<210> 775 

<211> 1009 

<212> DNA 

<213> Homo sapiens 

<400> 775 

agcntttttt ngaanttccc ctttnnttna aaaatcccct tttttggcaa aaaattnccc 60 

ccntntntna nngtttttnn gatncccaca tncngnaatn tncgggcncg ggnnactgnc 120 

nannggcncc cttcgggggn ccngtgntaa gncnatnctt gtntntanaa agntggnnnt 180 

nttttncgat ngngactatt gncnacnctc ttccntnttg gcagngngtc tgganggttg 24 0 

nggtngctca tntggntaan ccnatcctgg ngaccaanng gccgnggtgn gcntgcaagc 300 

tttgnccacn tgggaaancc gnnagtggtn gtctcanttg cntgntgggn ncntgncccc 360 

atcttgnctg ctgnancctt ggggagcagg nnctnggtng tggtnctgcc tgcttgctgc 420 

tngttccccg ggcatgcgtn nncannaagg gncatgcntn gggcaanaag gtgcgtggnc 480 

ancgtnngna tnnnnaggac caccntgggt cgngaatcnn tgggttncct gataggaacc 540 

ntnaannnct gcngntttta ttaaatggga nnanangggt ncanttcaaa gccagtnnaa 600 

tgcccttatg gaanggngtg natnacatan cnnnntatgt gtcntanann angaaatcgt 660 

tnnncaaatt tnnacaanaa tntttntaan aaagggtatt tnantntngg tgaaanaaca 720 

angntttaaa gtnaaatgnt tntancanaa ttaantaaac nggtnttnat gattncttac 78 0 

naaantaacn atncnnaagc atttacngct tanangtccn cnngatactn ncanaatatg 840 

gnnnnaattn tannanatng cgataatctn gnananactn tcatnnnnna tngtgtaatc 900 

antanntacn tgatttnnnt naaatgaaaa catntgatnc aagattaatn cattanntat 960 

acnaaaatnt tcanatanta natntacata taatggtttc naataaacn 1009 

<210> 776 
<211> 785 
<212> DNA 
<213> Homo sapiens 

<400> 776 

gnnnnnnntt cccctttcta atcncttgga nntcgctctn tntgnangat cccatngatt 60 

cgaattcggc acgagagaaa cacaggtgtc gtgaaaacta cccctaaaag ccaanatggg 120 

aaaggaaaag actcatatca acattgtcgt cattggacac gtanattcng gcaagtccac 180 

cactactggc catctgatct ataaatnngg tggnntcgac aaaagaacca ttgaaaaatt 240 

tganaaggag gctgctgaga tgggaaaggg ctccttcaag tntgcctggg tcttggataa 300 

actgaaagct gagcgtgaac gtggtatcac cattgatatc tccttgtgga aatttgagac 360 

cagcaagtac tatgtgacta tcattgatgc cccaggacac agagacttta tcaaaaacat 420 

gattacaggg acatctcagg ctgactgtgc tgncctgatt gttgctgctg gtgtnggtga 480 

atttgaagct ggtatctnca agaatgggca nacccnaaag catgcncttn tggcntacac 540 

actgggtgtg aaacaactaa ttgtcggngt taacaaaatg gattcacttg accaccctan 600 

aggccngaag agatattgan gaaattgtta aaggaagtca gcacttncat taagaaaatt 660 

ggcctacaaa tccnnganac aataancatt tgtgccaatt tnngggttgg gaatgggtga 720 

ccaacattgc ttggagccca agtgnttaac aatgccttng gttnaaaggg antggaaaag 780 

ttacc 785 

<210> 777 

<211> 1366 

<212> DNA 

<213> Homo sapiens 
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<400> 777 



ananaanann annnnnnnaa 
tnnanaannn aagnngnttc 
aaagcaagaa agaacagcta 
aggaatannn gnggncaata 
aaggcagcac aagctgngca 
cananatnca atatataagg 
gaagaaataa ntnttgacnt 
ccaactaana ggnctaagga 
atngnccnan anngaaatgc 
ncntnanccc acgcaactca 
gtnttcaaaa tngnacgagn 
ccanaaacac tggntnaggg 
ggctancata gangagatat 
ncgagttaaa acancnagat 
accctnggat gggcntangg 
tganagagcn aaanaanatn 
natatatncn natgananan 
cnnncatgac nnnaaaanga 
aantttgngn aaanctgnga 
tctatnnaaa aaaaagaggg 
gaaataaatn gggnacaata 
caacntgaga nnrmacnang 
gaagntataa agcaactgna 



ggnnaanana nnnnnannnn naanangnaa anananannn 
nanncttttc aaagcttgga aaacgcannc aarmnnnggg 
aagnnngncn cagaganagc ttttangang tntangaaga 
nnnnannnnc ngaaantatc atganacnca aatganggan 
aacagctatn gngacggggg ggccgggaga gnctaaangn 
actgcatgcn aagggatacn aaacaagnan actnntctag 
ancnnacntt cataacgaat agcaccgtac atcgagncaa 
aatggcaaan nacnttaatn nntgagcnaa ggaagggngt 
ntcntaacca anttttaatn gtaacggnat nangatnaan 
aaaanattac attanntaaa aaaganctat ancaaaacta 
aaatgggnaa nantttntnn ccgggaaaat tggnagagat 
naatanatgn ccgcccnaaa aaaccntnac cataggnatn 
ancnatnagg ggatcaanan cntaggnatt ngaaaantaa 
nnggnantac gaganatagc ttggacgngt atcaaatcgg 
aaaaanaaaa aggntngagn gaanttcctc anaggaanng 
aagggccttg gngaaaangg aaaaacagat agngtcatnc 
tggggnaatn taatctacnn tanatnnggg ggaaaaaaat 
gntaatgnna nnatgagaga ttaaacnnat aaaacnagag 
gataaaaaat aaataaattc tntntggaac atntanaccn 
gaaaccatct ngattatgca cananaaatn tnacntngng 
acatatatgn ggatgtacan tnntggncng aaaaactata 
atataaagcn nnaggnagtn tatangggca tcatcaangg 
nnctcatata naaaactgnn cnncaa 



1320 
1366 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



<210> 778 

<211> 775 

<212> DNA 

<213> Homo sapiens 

<400> 778 

gntttnnatn cctctttcta atnncttggc tactcgntct ntctgnanga tcccatcgat 60 

tcgaattcgg cacgagagat tatgagcatg tagaagatga aacttttcct cctttcccac 120 

ctccagcctc tccagagaga caagatggtg aaggaactga gcctgatgaa gagtcaggaa 180 

atggagcacc tgttcctgta cctcccgccg ccgaacagtt aaaagaaata tacccaagct 24 0 

ggatgctcag agattaattt cagagagagg acttccagcc ttaaggcatg tatttgataa 300 

ggcaaaattc aaaggtaaag gtcatgaggc tgaagacttg aagatgctaa tcagacacat 360 

ggagcactgg gcacataggc tattccctaa actgcagttt gaggatttta ttgacagagt 420 

tgaatacctg ggaagtaaaa aggaagttca nacctgttta aaacgaattc gacttgatct 480 

ccctatttta catgaagatt tttgttagca ataatgatga agttgcggag aataatgaac 540 

atgatgtcnc ttctactgaa ttagatccct ttctgacaaa cttatctgaa agtgagatgt 600 

ttgcttcttg agttaagtag aagcctaaca gaaggagcca accacaaaga attgagagaa 660 

atnaacaact gggccttngg aaagaaangc nggccaagct gcttgagtaa tagtcaganc 720 

ctanggaaat gatntggtta atgaattcac cccaggncac acccngttga agagc 775 

<210> 779 
<211> 781 
<212> DNA 

<213> Homo sapiens 
<400> 779 
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gcttttnann 
gatcccatcg 
ttagtagtag 
ggaaaggagg 
gacaaaattg 
cctacattaa 
aaagaagctg 
aactccaaat 
tgctcaaagt 
atgtggttcc 
cttaagaaat 
ctttcataca 
tatgaagatg 
c 



nccctncttt 
attcgaattc 
gttatgaaac 
aaccaccaac 
gtcagccatc 
tagaactctt 
caaggtggat 
gtgcaaaata 
ttacaaggga 
aaacagaatg 
gtcatgagat 
tactggatga 
tactttnaca 



cnaancctct tcaaatcctt ggntatcgtt ctntctgnng 
ggcacgagag acaaagaaaa aggtggcaat catagaagag 
ctctctaaaa agctgccggt tatttaaccc caatgatgat 
cacattactt tgggtccnnt nctacttggc acaacattat 
tattgctttg gagtacataa atactgctat tgaaagtaca 
tctcgtgaaa gctaaaatct ataagcatgc tggaaatatt 
ggatgaggcc caggccttgg acacagcaga cagatttatc 
catgctaaaa gccaacctga ttaaagaagc tgaagaaatg 
aggaacatca gcggtagaga atttgaatga aatgcagtgc 
tgcccaggct tataaagcaa tgaataaatt tggtgaagca 
tgagagacat tttataggaa atcactgatg accagtttga 
aggaagatta cccttagatc atatgtggac ttattnaaac 
gcatncattt tacttcaagg cagcaagaat tgcttttaga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
781 



<210> 780 
<211> 783 
<212> DNA 
<213> Homo sapiens 

<400> 780 

gnntttnnan nnccngnttt ctaatnctnt tcnaatnctt tgnnancgtt ctntatgcan 60 

gacccatcga ttcgggaatc tcctagaaaa gttgtgattt tcgagccata tccttctgtg 12 0 

gtagatccta atgatcctca natgttggcc ttcaacccca ggaaaaagaa ctatgatcga 180 

gtaatgaaag cactggatag cataacttct atcagcnaaa tgacacaagc accatatctg 240 

gaaatcaaga agcaaatgga taaacaggac ccccttgctc atcccttact gcaatgggtt 300 

atatcaagta atagatcaca tattgtgaaa ctgccagtta acaggcaatt gaagtttatg 360 

catactccac atcagttcct tcttctcagc agtccaccag ccaaagaatc caattttaga 420 

gctgctaaaa aactctttgg aagcaccttt gcatttcatg gctcacacat tgaaaactgg 480 

cactccatcc tgaggaatgg tctggttgtt gcttctaata cacgattgca gctccatggt 54 0 

gcaatgtatg gaagtggaat ctatcttagt ccaatgtcaa gcatatcatt tggtactcag 600 

ggatgaacaa gaaacagaag gtgtcagcca aggacgagcc agcttcaagc agtaaaagca 66 0 

gcaaatacat cacagtcacn ggaaaaaagg acagcaatcc caattcctgc caaagccgta 72 0 

acttaaaatg catagncctt atgtgaaagg gatcaccttc atctggacct gcacaaacat 780 

9gc 783 

<210> 781 
<211> 796 
<212> DNA 
<213> Homo sapiens 

<400> 781 

gnnntnegee ttcaatnctn ttcantctnt tcaatctttg aatcntcttt gttgtccatc 60 

gttcaattcg gacgagaccc ttatggcaga tccccacagt ctggggcaga agaggegteg 120 

aggngecaga agtgncggca gcagcagccg cagcagccca aagagaggca agagaaagag 180 

aaagcggccg gtggaggggt nnneggaaga gctggtcccc gtggttgagc tgggtccccg 240 

tggttgaatt ggaagaggee atagccccag gctcagaggc ccagggcgct tgggtctggt 300 

ggggacgegg gggttgcccc caatggtgca getgeagcag tcaccactag ggggtgatgg 360 

agaggaaggg ggccacccca gggccattaa caaccagtac tccttcgtgt gagccaaccc 420 

cacccgctcc acccttttta aaccccccag cccttgctcg tgagattggg cttgggtagg 480 

gacagaagag gcccgaaatc cctcccccat gettnetgae ccttgtttgg ccaaagggca 540 
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tctttgatgg tacaaagcag 
cttcccaagg gcaaggggat 
annccccacc ttnggggcca 
nagggggnaa agggctttcn 
aaaatgggga aancnn 



angcttcggg anaagcttcc gtcacaacac tncaaggtcc 
ttnggcttca tgagctnctt tgaggggctt ttttttggtc 
tttttcccaa ttaacttacc cccaacccca agncanggtt 
anttccatta aagggggttt gtttgttgnt gttttaaacc 



600 
660 
720 
780 
796 



<210> 782 
<211> 886 
<212> DNA 
<213> Homo sapiens 

<400> 782 

cggnnnnnnn gnagcccntt tggnaaangc ctctaaggga aangcctttt tgaaaacnan 60 

angaaaacct ntgggaaaag nccncannna ttttngngaa annggcnnga gcnnanantn 120 

ggacacngtt ntaannnnan nagngnnngt tttnnganan agggnnnnna gnggnannna 180 

ngngnnggag ggaannaagg nanagnannn ggnagnnaag gnnnnaaaga agnagnnang 240 

gaganggnnn gnggnggggc atgangnggg nncagaggca cgaggagccc aagaccatca 300 

cngangagna ngagcagggn accnacatnn acnnggacna cgagaagngg ggccagcgga 360 

agaaggaagg nagnacctng agnaccgnta ccaggaggan cgggaccnac agngacanag 420 

gnccrmnncn anacggannn nanaaacgng aagcaggann nnnanggacc aagggaaggg 480 

nncngnncnn ggaaaganng ggagggaggn ncgaaggcaa aggggggann cgnnannncc 54 0 

aggaagnang gaaggggggn cgggaggnna annganaaga ngaaccnngg gggnncaggg 600 

gggcgagggn agcanaanrm nnccnnagnc aanngaaggg gananaagag ngggaaaann 660 

aannagaaag agggaaaana agnnaaggaa anaaaagang ngnnaannng gganaaaana 720 

nggnganann gnngganaaa ngngnarman aaaanngagg aggncanngg gnaaanaana 780 

nggggagggn nganananag ngaannagac aaggaanagn gaannagngn anagnanngn 840 

gnannaaagg nannggggna anaagnanna nannnnnagn gaagan 886 

<210> 783 
<211> 805 
<212> DNA 
<213> Homo sapiens 

<400> 783 

cnaatncttg ctcttgncct ntttcnaatn cttggcnact cgctttctnt gcggatccct 60 

cnnganncna tcgttcgaat tcggcacgag cacaaggaga agaaagttaa ttaacattga 12 0 

aagatgagaa gacatcttgg aagacttgaa ttgggccttg gaagaagaac agccattcaa 180 

atagatagaa ttgtggtagc aaaggcatac ngntcggaaa gtatagatct ccagggacag 24 0 

tagtcatggg gttggggcac tgttggaatt taaggttgga aggatatatt ggagcccctt 3 00 

gaatacggta acaaggcaca ccttgggcag tggagagtta tcagagtgtt tgaaaaggag 360 

ggttattgag taaataaata gactggtact ttaggaattt taaaatgtgg atcattgtac 420 

tactaataac tatntatttt atatttacta tctactaagt aatttacatg tattttcttg 480 

tactgactgt aaaccttctg ggtgtgggtg ttttaagtgc cattttactg ataaagaaac 540 

tgangcttaa atagntgaaa tanntcaccc tgttagtgag tggcacaatg acaagtcann 600 

atcttanggt tgccnanntc caaaanncat ttaaanttnn agnatnattg annnttttnc 660 

cttatggcnt nnnaaatttg gggagccatt attgaaatcc nttacnacnt angaattgnc 720 

caaaaaaaat actttttggg gaaaactgga tttattaatt atccaaaata atttnantgg 780 

cttgnttggc ttntttccac tntnc 805 



<210> 784 
<211> 776 
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<212> DNA 

<213> Homo sapiens 



<400> 784 



taatgctggt 
atcgattcga 
ataaagcctg 
gcagngtgcc 
tcgtanaaaa 
tattggatga 
gagtatgtgg 
ttaatgggta 
tgtgcccaga 
ttgctatagn 
gccaagtaaa 
aanttnagaa 
catntttcgg 



tactgccctt 
attcggcacg 
caagttacta 
aaaggtgaaa 
acgtctttac 
atcatttgcc 
gtatagaaat 
gatcataaat 
tatgtnttgg 
gnnactattg 
ctgtactgtn 
cngatgtctg 
ngctgaggtt 



caaatccttg caatcccttg gnaancggnc cngcngaccc 
aggttatatt aaattattct ttgntnttct ttgtctttta 
aattgnagtt ncataaattc tgtagtnaag tatcatcttg 
angntgcttn ctctaacaga gaaattctta gngactccag 
aacctgaata agatnganga attgngaaca taccatggcc 
ggnggctana ncagactgta gggtttgtga tggatntatg 
catgaatntn ccatttgnnn ncagagattc aagcntanac 
gacagaatga attcaaaacc tagcacgtgc attgtaaatg 
aaatggcagn tccttggggt catgtntcta ctggcaaaat 
nantgtaatt ataaaattna tcannattat ncaccgattn 
cataggaatt ttgggaattg tgcanaaatt ggatcaattg 
ggcttaaaaa tttatcnggg accacnnatt angaaactna 
cattgnccaa ggccangaag gtntttnccg aaaanc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
776 



<210> 785 
<211> 778 
<212> DNA 
<213> Homo sapiens 

<400> 785 

ttngaaaacn ccttngcttn gttnccccta cngaaaccct tttgaaaacc ntttgcnann 60 

tcctctttnt gnaggatccc atcgattcgt gaaagaggag atcggtgacc tgggctcctt 120 

atgtgcctga atgagtttga gtttcctgtt aactccaaat caacagtatt ttcaacaaga 180 

aatgtgcaat tgaaatcaag tgctgtttaa gtgcagctag gantccacag gaagacactt 240 

gcagtgaaca gagttatgga gcagcaaaaa cacagatcta tttggaaaaa gagaaaacat 300 

atgcgttgta ttttgcttca attataaaat accatcctct caaaggtggt tctaaattac 360 

aaaggacttt gatttctagg tagattctgg gtagagactt cctttcatat tgaggcatta 420 

atgacacctt ttaacctggg aagcaatatg actggagttg tactttgaga agattaatca 480 

ggtttggttg cagaatgaaa gagaagatga agtcaagaga ttggtttaga ggctctagca 540 

gaagcttagt catatttcaa aatgatcaaa tatcaagaaa aattctgagc tgcataactt 600 

gtataaagta attttcagtg atttttttca tggttatgat aaaagaactg gattagcaga 660 

aacttttacc ctgaatcaag atttaatttt tctttgagct catcttaagg atatcggaac 720 

atagggagca aacgatggtg tggctgcctc antgcttgaa ttttaacngt tttgaaan 778 

<210> 786 
<211> 805 
<212> DNA 
<213> Homo sapiens 

<400> 786 

ngccccccct ttcccccctn ttgaaanccc ctttggnana nnccnntttc aaatcncttg 6 0 

naaatccttg gcnactcgtn ctntctgcag gatcccatcg attcgaattc ggacgaggag 120 

aggatcactt gagcttagga gttcaaatcc agcctgagcc aacataacaa gactttgtct 180 

ctaaacaaaa cagttattgt ttaaagaatc tgaaatcttc atctttaatt caggtagccg 240 

tgaatcgagc ccaagtttgt ttgatatcca gttccaagtc tggagagagg catctttatc 3 00 

ttattaaagt atcgagagac aaaatatcag acagcaatga ccaagagtca gcaaattgtg 360 

atgcaaaagg gctatcaaag ggaggctttt tacagagaac taaggaagag aaggaggttg 420 



249 



BNSDOCID: <WO 9933982A2 I > 





WO 99/33982 



PCTAJS98/27610 



ttaaagagac 
gaagtatttt 
ggtccccaac 
gagcagtggg 
cagcgttaga 
tangttgcaa 
tnaaccccca 



ttgagatcag 
gctgagcaaa 
cccggtacca 
tgggcaagcg 
ttctcatagg 
cgcttcctta 
gatgggtact 



aaaaagatca agaacaactt 
catttgaatg cctgtatgta 
gcccgtggcc tgctagggac 
accattccca cctgagcttc 
agtgcaaaac cctattgtaa 
tgagaanttg aatgcctgan 
ngttc 



gaatctcaaa 
ccgtaatcct 
tgggccgcac 
ccctcctgtc 
actgcccatg 
ngaactgtca 



gtatgaattt 
ctatcactgg 
agcaggaggt 
agatcagcag 
ccaagggatc 
ctgncttcca 



480 
54 0 
600 
660 
720 
780 
805 



<210> 787 
<211> 775 
<212> DNA 
<213> Homo sapiens 

<400> 787 

ccttggnnag nngccccctt naaanccttt gaaaaccctt ggcaaangcc ctnncngnnn 60 

gatcccatcg attcgaattc ggacgaggag aggatcactt gagcttagga gttcaaatcc 120 

agcctgagcc aacataacaa gactttgtct ctaaacaaaa cagttattgt ttaaagaatc 180 

tgaaatcttc atctttaatt caggtagcac cgactcgagc ccaagtttgt ttgatatcca 24 0 

gttccaagtc tggagagagg catctntatc ttattaaagt atcgagagac aaaatatcag 300 

acagcaatga ccaagagtca gcaaattgtg atgcaaaagg gctatcaaag ggaggctttt 36 0 

tacagagaac taaggaagag aaggaggttg ttaaagagac ttgagatcag aaaaagatca 42 0 

agaacaactt gaatctcaaa gtatgaattt gaagtatttt gctgagcaaa catttgaatg 480 

cctgtatgta ccgtaatcct ctatcactgg ggtccccaac cccggtacca gcccgtggcc 540 

tgctagggac tgggcccgca cagcaggagg tgagcagngg gtgggcaagc cgaccattcc 600 

cacctgagct tncccctcct gtcagatcag cancagcgtt agattctcat aggagtgcaa 660 

ccctattgta aactgccatg cnagggatct aggttgcacg ctccttatga ggaattgaat 720 

gccctgatga acttgncact gncttccatc acccccagaa ngganctggc taacc 775 

<210> 788 
<211> 774 
<212> DNA 
<213> Homo sapiens 

<400> 788 

gaaacccttt tgtnaanagc cncttcaacc cnttctaatg cttggcaatc gctctntctg 60 

cangacccat cgattcgaat tcggcacnag attatttcca aagcagccta cagtagaaaa 120 

tagtcattat ggcagcagct tctgatgttt ttgtttggta ggttttctga tttcaatata 180 

tagaatcata ttcatagagt atcttctntn ccgcctngca caaagtaccc atttaaaatt 240 

tacatgcaca gttcattgcc acctttctta ggcctatgca tagttaataa ggttataatc 300 

tactcaacat ggaaaatgga gcctatttgc aaacacacaa gtaattaaag taccaattct 360 

ctcttagttt ctttttttat agttggttta ttttgcaatt ataaatgtta aacatcccta 420 

gagatgaaag ttaaaatggt tgatcacaga tcagtagcaa aatacaaatt gacaattcaa 480 

aattataaat aaaactctgt tgaggatgtt taactttgag tctccaaatt taagagctaa 540 

gcttggaaga aacaaattta taggttatat ttccctctta aattaaanaa acaaacttcc 600 

tctggcagta gtttggtgaa ttcctttcat tgnaatgata ccatgattac aggatcaaaa 660 

atgcttaact tacttgccat tctgctcaca tcatcacagg ttgttntttt tttaaagcac 720 

tcnatgtagg cattttaaac cttcnggata accagagtat cttttgagaa annc 774 



<210> 789 
<211> 773 
<212> DNA 
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<213> Homo sapiens 
<400> 789 

ngcccctttg aanccnacng aaatcctttg gcnantcncn ctntctgtng gatcccatcg 60 

attcgaattc ggcacgagag cagatttgng ataaacntnn tgnaggttna accnaagggg 120 

aactnntggt gcaactatgn ngnttggaag atgctgcnta tgtttattga ggattgcann 180 

anananatcc tgaatnctcg ccntttncaa aggcttggat aaagcactca agccagctac 240 

atatgtatag aacggnttaa aatcnatgag gaagcctgga ctaaatatnc catnggactg 300 

gngccnanaa ngctgncgat gaactttgna tctggnnaga agtntaaaga atggcaggat 360 

nantnnctaa ngatgaattt cannacnggn nnnccaccan tcttnaatnc tttaagatca 420 

ttatacgaag ncnangaaaa ggtggcaatc atngaanaat gngnatnatg ttangaaacc 480 

tctctaaaaa gntgacggca ctttaacccc natgatgatg ggaaggaggn accaccaacc 540 

acattanttt ngggtccagt actacttggc acanccttat nacgaaactg gncngtncnt 600 

ctattgcttt gggagtaccn taaaatacng ccntngngag tncacctnca atgaatnnaa 660 

nctctttntc anganagctn nngatccata ngacntgctg ganatnttta aggaancttc 720 

nanggnggan tggattaggc ncaggccntt ggacacancc ntncttnatt tnc 773 

<210> 790 
<211> 953 
<212> DNA 
<213> Homo sapiens 

<400> 790 

aanannnngg gnnnnnnnnn nnnnannnnn nnnnnnnnan nggngnnttn aaanccttnt 60 

aanngncnnt ncngcttnaa accttggnaa ncnccgcccn nttgcannaa angngaannn 120 

atgcttngtg aagcctgann ccaaanctna aggnanggac ctggatcccc ttatatngaa 180 

naancggtnt ggaggaanga gnntgtcngg gaggatgggg cagaaaatga ngnnggcaga 240 

ntggncccgg gggctctgca naccagcctt ggagcctgct cattctgggc ccttgctgcc 300 

aagganccca gcctnaccta gcangaaang anatgaaagc ccttctccca ngaggtaggg 360 

tctaggctgc ccnaacttaa atgcattnag aaanctcnta gatgtggaaa natttttncg 420 

aacctgaaaa tgcagctggt anaatntcaa tgggaagcat aaatncatgt aaaatataat 480 

tnagntngaa tatnanngta aaaatgcact tttnngcggt gtgacngatc ctgggnnccc 540 

annatctgnn attnaagngn tttacnaang gaanggaaag gacctttncc taaactacct 600 

ttttgaacag ancattaaga angnncnttc ttttaagnaa aaaaaaatca aattttgang 660 

aaaantggna ttngaatgtn nagaaaaang gatananaan aaaanccaat nntaannacc 720 

nannctctct gganttcnac tatctccact acntacntnt acntatngcg ntaanatnna 780 

ctnttacntc nnnntantcn cacanacntc ntcnaacnta atnangcncn canaatcctc 840 
tatannatnt antgtnnntc acannncnna cnggntaant ntnnncaacg ccatatcacc 



<210> 791 
<211> 798 
<212> DNA 
<213> Homo sapiens 



900 



nctnnnatcg ncnagntana taacacntat atcgncactc ncacananac tec 953 



<400> 791 

tggnancgen ctntntgttt gatcccatcg attcgaattc ggcacgagga tcattgttaa 60 

ttagtgacat agtaacatct gtagcagctg gttagtaaac ctcatgtggg ggtggggtgg 120 

gggtgtattc cttgggggat ggtttgggcc gaatggggag tggaatattt gcncttcncc 18 0 

tgttttaaat tctaggatag attttaacat ectttgeggt cccagtccaa ggtangctgg 240 
tgtcatagtc ttctcactcc taatccatga ccactgtttt tttcctattt atatcaccag 



300 



251 



BNSDOCID: <WO 9933982A2_1_> 





WO 99/33982 



PCT/US98/27610 



gtagcctact 
atataactga 
aatctgtgta 
taaacctacc 
tttntnaacc 
tnncaaaagg 
aatgntgntc 
atrmtcgatn 
agttctcant 



gagttaatat 
atttcatgag 
tctaatacta 
caaagtnaga 
tcttggcttt 
tgttngatnc 
catgcctnan 
aaatnntann 



ttaagttgtc aatagataag tgtccctgtt 
aagatttatt ccaccanggg tatttcannc 
acccaatctg tttggatgtg gattttaaaa 
tttacctgna tttaaatggc ctttngggtc 
aaaatgcgtt ttattctnga taagatactt 
naattacttt aaaataaaac ctgtaattgn 
tccccttcta gnnntanaaa cntnantaan 
actataaaaa ctncggccct cttananact 



ttgtggcata 
tttgaaacca 
aaatgtttgc 
ttgaaaaagc 
cnaaatancc 
ataatgncat 
aantatatca 
tnatncttga 



360 
420 
480 
540 
600 
660 
720 
780 
798 



ataaccnc 



<210> 792 
<211> 788 
<212> DNA 
<213> Homo sapiens 

<400> 792 

ctnttgttct ttttgcagga tccatcgatt cgaattcggc acgaggcaga gctcacatcc 60 

tgtgcgcagc atcttctgtc ccctcatgtc cttccgccag ggggcctgcg tggtgacggg 12 0 

cagtgaggac atgtgcgtgc acttctttga tgtggagcgg gcggccaagg ctgctgtcaa 18 0 

caagctgcag ggccacagtg cacctgtgct tgatgtcagc ttcaactgcg acgagagcct 240 

actggcctcc agtgacgcca gcggcatggt catcgtctgg aggcgggagc agaagtaggg 3 00 

tcctgtcngc cctgctgctg tcctccatcc cacccctctt actccacctc gtgttgtaaa 360 

taaagtttcg gtggtcatgc tganggccgg ctcccagctc tgccggggac ggacagggca 420 

gaaggcancg ggcaacttca ggaacacggt gaaaaaaaaa aaaaaaaaac tcgagcctct 480 

agaactatag tgagtcgtat tacgtagatc cagacatgat aagatacatt gatgagtttg 54 0 

gacaaaccac aactagaatg cantgaaaaa aatgctttat tttggggaaa atttgggatg 6 00 

ctattgctta atttgnnaac cattntaaac ctgcaaatta aaccaagttt aacaaccaan 660 

caattggcan ttcattttta atggttttna aggttcaagg ggggaaggtt tttgggaagg 72 0 

tttttttaaa attnncgggn ccnnnggngc ccaatgcatt tggggccccg ggnccccaaa 780 

nttttttt 788 

<210> 793 
<211> 806 
<212> DNA 
<213> Homo sapiens 

<400> 793 

gaatcccttt gcttctgtcc tttaagnnat cgttggaaca accatgnctt tttgtaggtg 60 

aagtgttctc tctgcatgca acagtaaaaa ttaatataat atttttncca caaaagaaac 120 

acttaacaga ggcnagtgcc aatttataaa atttatgatc taaaggggga aatcatggat 180 

tataaagtcc ttcagccctt tgggactcta aattggnggg ggattaaaaa gaatttaaaa 240 

taattttnga accgaattta ttttcccctc agtttttgag ggcattaaaa aggcattaaa 300 

tcaagacaaa tcatgtgctt gagaaaaata aaattaatga aaacncagca ctttatgttg 360 

gtttaacntg cancctnctt tggaggtaga atttatttat ttaaaattac tgggtgcatc 420 

angaacccat agggtgtaca aaangttcta ttaaaatctg cnttatagag acaaagaggc 480 

aggcaaatcc atgtnacaaa gggtaaagct tacagtttac aaactgngaa cgccanggtg 540 

taggatataa aaacgcactc ttgagaaaac anatggtcat cagggtgctg aaaacttgca 600 

tggtgctttt caacattagc ctttggtcca caaatttctt gtatttgaca ggatccatag 660 

tgtgccatgg ggcaaganac nattttgccc tctatggtnt tctttaaaaa ttttcanttt 720 

aaaaatacct cttttnncag gaatcctaat tttggcnccg aagcntattn ntggtnccac 780 

atttaccgtt gcccttgccn ttggan 806 
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<210> 794 
<211> 815 
<212> DNA 
<213> Homo sapiens 

<400> 794 

tttcaaatnc cttggcttta nccctttgtt tganntcctt gttcgaattc ggcacgaggc 60 

cttctctggc ctcaccaatt aggtcaaatg ttccttattt tgtgttgtgg ggcatggctc 120 

tncctgtgag gacctgtccc agcttggacc tccgccttcc tgcgactgta ttggtgtctn 180 

tccctctcaa gcctatgagc tcttgcaagg gcagggaccc tgtatgattt tgcctatcgt 24 0 

atgtcctcca gcccccagca cangcgcctg gtgtccagtg agagctcagc aaatactttg 300 

tgagttaaan gacangcggg cttggggtag atggatccgt ctgcctanac agggcangtt 360 

attcccgctt gtgagcaact cttaanagaa acttcatttt ttttcggcgc ctgcncgaac 420 

tttcaaagat gtttcccggc cangaacngt ggctcacacc tgtaatccca gcactttggg 48 0 

aggcttgaag tgggtngatc accttgaggt cangantttn tagaccagnc tggccaacac 54 0 

cggtgaaacc ccgtcctctn ctaaaaatac aaaanttaac tgggtgtngt tggtngggcg 600 

ctttgnantc tcactacttn ggaangctga ngcnatgaan aatttgcttn aaccccngga 660 

iiyycngaagt cceaatLgan gtcnanactt nancccatcc gcgccxtzcan accctggggc 720 

aacangtatc annaacttna acnattaaaa aatnaanana nctcttatcc ctttannaac 780 

nattattgan gntacntatt ntcntagaaa tccct 815 

<210> 795 

<211> 1050 

<212> DNA 

<213> Homo sapiens 



<400> 795 



tttctaatgc ttggctttga 
gcggcgctga cccggncggn 
ttcctgcctc caagacctgg 
cccatgggac ctagactcac 
ccctgttcca gctcccgacc 
ccagcnagcg gncttcagcc 
tttgcagaaa cccananggg 
ctgncctgtn ctgcatcatt 
cntgntcaca atcaancatn 
tacntcntan nnnnganttn 
ncatgnantn ttagtntntg 
anannaancc ntccttnnag 
atataanntn ntntnctnnc 
ggnnactatc ncatantaca 
nantnnnntc tcctnnanat 
natanngnnn anatcnannn 
nannnaanng tatnntngtt 
tacnccnntn arm t a anna t 



gncctctntt taaaatcctt tggcnactac tctgcacgat 
cccacacccg ctctttnctc ttctttgccg cggactccct 
gtgtctacaa ctgtgagccc agcttgnncc aaaggcagtc 
cttncccttg cctctatgaa accttctgct tgggcccanc 
tgcacttcct tgctgggact cangcctcca agctccctgc 
accgtcttcc cctttctttc gggccctgnt tgtnagcanc 
acctngtgcc ccttgcnaag nctgtcgcct tggtgcaaga 
ttncatggtt gncgggggtg tggggntnnn cnngncgnnn 
tatncctnan ntngggtatn acnaatggcc tnaagantgc 
tcangnnntn ttactaacnt ncnatngnnc ntnganatag 
atntanccnc nattgcagcc ncataattat cctacaccac 
aanntgncnt ctatgnaana gnctnnnaat gtggcnncna 
atcntannnn nntcctacgt nannnnncat nnncnctntn 
tcnntnannn cacccatnct nntntnanat ntctcntggg 
ncnctaatna ngatctctca nntacatgan ntanatnacn 
ngtctctcnt atnnnttatn nanngntcan nttacnnnan 
cnaaanntat ntataaancn ncgtnnnttt nnannagatg 
ctangctccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1050 



<210> 796 
<211> 884 
<212> DNA 



<213> Homo sapiens 
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<400> 796 

ggnnntttng agctcggaaa tcncttnggt nnagcctttc nttgacccca ttgttcgaat 60 

tcggcacgag acggcctggt ggagcagctg tncgaccttt ncctggagtt cctgcacagc 12 0 

caggcacact gcatcggctt cccggacctg gggctgcctg tggtcctgca gntgaagtcg 180 

ttcctccggg agtgcaaggt ggccaactac tgccggcagg tgcagcagct gcttgggaag 24 0 

gttcaggaga actcggcata catntgcaag ccgccgccag agggttncct tnggcgtttc 300 

tgagcagcag gcagtggaag cctggganaa gctgacccgg gaagagggga cacccttgac 360 

cttgtcctac agccacttgg cgcaagcttg cgttgaccgg ggaagatcca acttgggaga 420 

tcaanngggc aaaagaaccg gcttggaaag acctggaact ttcccttgag atcaaaaccg 480 

aaanggaaga atgggcttga canggaangg atgaaggaca gggaagccaa ttttaaaaga 540 

ccctctttga cctgnacaag ctcttgaaaa aggacgacac ccgaggggat tcttcggaga 600 

nnagggatac tgangccccc tgagcacctc ggcatggggg tngggaagac cattgnaaac 660 

aaggaccaag gaaggaaggg ccnaaggaag ggacaagcan ncaaactcgg aanggntgna 720 

atgggncctt ngggantngg aggaacccca naaccccaaa aaggccgggg ggcttgggcc 780 

cccttggggg gaancttnnc aacaaaatnt gggccccaag ggggccccgg aaaggaacga 840 

aaccttggaa gggaatcttg ncaagcttct tanaaaaggg ancg 884 

<210> 797 
<211> 773 
<212> DNA 
<213> Homo sapiens 

<400> 797 

taatgcttgg ctctgtctnt tgttgacccn tngttcgttt gtgcctgagc acccacaatt 60 

tcaggattta gactgtgtgg gcacctcagc tttcctctgg ntgtaaccac tccttggtga 120 

nagagggaac tcctaaccan tcccatttgn caaaggctag gcaatcttca ttctgcttgg 180 

ctttagtcat tcttgtcatt gggctgcaga agaaaaacaa ctttgctggg tgatcccact 24 0 

gccttgattt cacctcggan cgaggctggg ccatgtccaa gtcttatgag gtcaccctga 300 

ctagaaaaaa ttgaactcac ctacaaatag tctgaaagag tggtgtatat caaatacgtg 360 

ggtagtgttg catttcaaat gangctcttc tgggttgaaa tgatatattt ataaaaccag 420 

aatatcaaaa atgggtgatg tataatgtct ctttagtttt tttggtattt ggcctctttt 480 

aaagcctgtc ngatgtatgg gagaaaaaca atgaaccgtg ctttgatttc ctatcaagtc 540 

actcttaaga acatacatat tggttaaagt aactcggtct ttttttatct gattctttga 600 

ggcactatgg gtagcaaaat aaccacttac aaatttaaat gtaatataca cttcttttct 660 

gngtgtcaag tccttatttt tangtgccta attggacatt ttaaaaggtt aaattattng 720 

gttggcatat taatntcaaa aaatctatta attnatttta atgcctggta ccg 773 

<210> 798 
<211> 812 
<212> DNA 
<213> Homo sapiens 

<400> 798 

gtcaatnctn ttcatgaccc tatcgattcg aattcggcac gaggctggag cacgctggag 60 

aggccatccc tgccctggca gcccgcggct gggggagact cctttgcccc attctttgcc 120 

ggtttcctgc cattattggt gtgcaagaca aaacagggct gcacagtggc agagaagtcc 180 

tttgcagtgg ggaccttggc agagactatt cagggcctgg gtgctgcctc agcccagttt 240 

gtgtctcggc tgctccctgt gctgttgagc accgcccaag aggcagaccc cgaggtgcga 300 

agcaatgcca tcttcgggat gggcgtgctg gcagagcatg ggggccaccc tgcccaggaa 360 

cactttccca agctgctggg gctccttttt cccctctggc gcgggagcga catgatcgtg 420 

tccgtgacaa catctgtggg gcacttgccc gctgttgatg gccantccca ccaggaaacc 480 
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agaccccaag tgctggctgc 
atgggtcacc atttgggcgc 
tataaaatgt nggctccccg 
gaccaaccaa aggattccca 
ttnccttgga ccgtttcctt 
accagntttt tggggcttta 



ctactgcatg ccctgncact gaaaggagga acttgnaaga 
ctttttaact ttctgtacca gancaacccc ttgacaaggt 
aaccttnttg cgtattcttg cagncctcaa ttcttggctt 
cccangaaaa cccnaanggg cccnaaactt gttnncttgn 
ggggccaaaa acaggnanaa cccggacang gttttttnaa 
aattggcctt gg 



540 
600 
660 
720 
780 
812 



<210> 799 
<211> 758 
<212> DNA 
<213> Homo sapiens 

<400> 799 

ctaatagctt ttcattcnaa tgcttgtgat ccctcgattc gaattccgtt gctgtcggac 60 

agattgccct agtacccacc cacctatcag ggttatgcaa tggaacatcc tcgcccaagc 120 

tcttggagaa ggcaaagaca actttgtaca gtgccctgtt gaagcactca aatgggaaga 180 

aaggaaatgt ctcatcctgg aagaaatcct ggcctaccag cctgatatat tgtgcctcca 24 0 

agagguggac cactattttg acacctccca gccacccctc agtagactag gctatcaagg 300 

cacgtttttc cccaaaccct ggtcaccttg tctagatgta gaacacaaca atggaccaga 36 0 

tggttgtgcc ttattttttc ttcaaaaccg attcaagcta gtcaacagtg ccaatattag 420 

gctgacagcc atgacattga aaaccaacca ggtggccatt gcacagaccc tggagtgcaa 480 

ggagtcaggc cgacagttct gcatcgctgt tacccatcta aaagcacgca ctggctggga 540 

agcggtttcg atcagcttaa ggcttgtgga ctcttcagaa cctgcaaaac atnacccaag 600 

gagcccaaga ttncccttat tgtgtgtggg gacttcaatg canaccaaca gaanaaggtc 660 

tncaaacact ttgcttcttn cagnctnaac cttganagnc ggcctacaag ntgctgaatg 72 0 

cttgatgggc aatttagaac ccccatacac ctacctgg 758 

<210> 800 
<211> 770 
<212> DNA 
<213> Homo sapiens 

<400> 800 

ttnaaancng cnttggactc cttgcaggat cccatcgatt cgtttaaact gagctccaaa 60 

tgacgttcaa acacccctct cgggtagagt tttcatggtg gaacggttgc gcccaccaaa 120 

cagaagctta tgtttttggc acagaagcct gggccatttt catggacacc tggctggacc 180 

tcggtggaag tgaactccgt aggttgttgc gttcactgca gcacctcaca tgataccgtc 240 

ccctctcatg gaacggagcc tcccccatgc agcccccact caaatggagt tttaaaggct 300 

gggttcaggt tacgggggcg tttctcaccg tctgaatgcg gaggacagag acnagctcca 360 

gggagcgtgg gcgggtgacg gcgctgagat gcgtgatgtc tcggaaacgt cctcgcatcc 420 

ctcancgcgg gcgctgactg ccgcggccct tgcctgtctt caggagcgct ccagcttcgc 480 

ccacacaccc cgggctgatg tcccctcgct ccggcggcct gcagacccca nagtgcctgt 540 

ctcgggaggg ctccccattc acacgaccct gagtttgggt ccaagttagc ttctgtccca 600 

aagtaccngt attcccaaag cgcacccggt aaagganccg ggccggncct tntttgcggg 660 

gccgggggcc ggggccggga actcgtnggg ggttgccngg aanggggtta accgtnccgg 720 

ttnttccgnc cttncgtgca aggcttnccc cgttaagngg cccaaaccnt 770 



<210> 801 
<211> 573 
<212> DNA 



<213> Homo sapiens 
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<400> 801 



ggagccctag agctccacaa 
ccagccacac tccaacacag 
ctgctgctgc tggccaccag 
aggagggacc tctggtgagc 
tagggcgcct atcgccaatg 
cctcaagaac atccagagct 
agtcatgtga gtatcttccc 
gcgcccccac acttctccta 
gagccacact caagaatggt 
aaattgtcca aaagatgcta 



caggactcag agcctctaac cagttccagc actccagact 
caccatgatc ccagccaccc gctcgcttct ctgtgcagcg 
ccgcctggcc acaggtaggt ctcgccactg ccactggggg 
gcagcctccc acagtcccgc tgaccaagag tcttctccca 
agctgcgctg tcagtgcctg cagaccatgg ctgggattca 
tgaaggtgtt gccctcaggg ccccactgca cccaaaccga 
ggttagcttc tgccacttcc agactcgccc aaaccctccc 
gtgggaatgc ctaacatgtg ggtctatcct tctctctgca 
cgcgaggctt gccttgaccc tgaagctccc ttggttcaga 
aagtgagttg tga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
573 



<210> 802 

<211> 1390 

<212> DNA 

<213> Homo sapiens 

<400> 802 

tttttttttt cacaaggaat atcattttat tactgtaatc acaaaatcgt aatttctgta 60 

caggaatgta taagtgaaca ttattcaaag cattggtaat tcacttcata aagagggtaa 12 0 

acatactaca gaacatattg taaagaaaaa atattgtaaa attttctggt cttgcagtgc 180 

actatttagt gcaagtattt aagacacaat agtgttcaat tcagcaaagt attgcagaat 24 0 

gtcatgccac agtccactta attcaaagag ggtcaggaca tgcagcttgt aataaaatgt 300 

cagagtgtgt gtgtgtgtgt gtgtgtgtat ataaaaccac atgtaattca taaaatatat 36 0 

agtggtttat ttagatggtt ttaaatgatt tcactgtgga atccagcata actggaacaa 420 

catccaaggt cttcttaacg gcaacaatct tattgctagg caatggcytt ggcttcaggt 48 0 

argaatgcyt cccagtattt tatcagctgt tgttgtgttt gaactagtga ttctaagtac 54 0 

ttgatgataa cggttttaaa atccttcact cgttctttct caaatcttcc cacttctttt 600 

cgaatcgttt tagatatctg ttcaaaatct ctttcccctt gttgcacttt cgcctcccac 66 0 

tctcttattt catttttagc ttgctgtatt ttatctggtt tgttagcaac catcattttt 720 

gcttcagctt cacgtttttt gagcaaagta atttgagcat cttcccattt ctgccagcac 780 

ttcattcgat ggtcaaacac acctttcact gcagcaataa gacgaatgta gtcactaagt 84 0 

agttctgaaa acatataaaa gtcagmaaaa gcttgttctt gatgtaactg gtctatcttc 900 

tcctcaacct ctgcaagctg agacaaagct ctagataaag cagtatgatc ctcagaatta 960 

cctaacatgg cagcactttt agcaaaggca gctgtgttgg ctgaaagttc ttttctatga 102 0 

cagamcaagg cttcaacact gacatgaagt ttcctaagtt gctgatccag attctcaaat 108 0 

tgctgctgct tttcttcaaa ccatgcatcc gattcattca tcttgattgt cattttgttg 1140 

acagcgtcgg cagccttgtt caccatcctc aatattcctg ctccactcag agcctgtgta 1200 

ttaactgctc taggcagctc tgaactttcc aagaactgcc ttaaatcagg atcctgtagt 1260 

aaagttggat gttttactgt tctttgaaga tacctttcaa gagctgctct ccgtttttct 1320 

acaaactcag tggatgatga gtcttcttta cccactttga ccttggtcat ccctactata 1380 

ctcttttctg 1390 

<210> 803 
<211> 947 
<212> DNA 

<213> Homo sapiens 
<400> 803 

ggaacttctg agtaattggt atcatttcct agtgactcgg ctcttgtact ccaatcccac 60 

agtaaaaccc attgatctgc actactatgc ccagtccagc ctggacmtky Jckcwsgsagg 120 
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ngagagcagc ccagaacccc tggacaacat cttgttggca gcctttgagt ttgacatcca 180 

tcaagtaatc aaagagtgca gcatcgccct gagcaactgg tggtttgtgg cccacctgac 240 

agacctgctg gaccactgca agctcctcca gtcacacaac ctctatttcg gttccaacat 300 

gagagagttc ctcctgctgg agtacgcctc gggactgttt gctcatccca gcctgtggca 360 

gctgggggtc gat tact ttg attactgccc cgagctgggc cgagtctccc tggagctgca 42 0 

cattgagcgg atacctctga acaccgagca gaaagccctg aaggtgctgc ggatctgtga 480 

gcagcggcag atgactgaac aagttcgcag catttgtaag atcttagcca tgaaagccgt 540 

ccgcaacaat cgcctgggtt ctgccctctc ttggagcatc cgtgctaagg atgccgcctt 600 

tgccacgctc gtgtcagaca ggttcctcag ggattactgt gagcgaggct gcttttctga 660 

tttggatctc attgacaacc tggggccagc catgatgctc agtgaccgac tgacattcct 720 

gggaaagtat cgcgagttcc accgtatgta cggggagaag cgttttgccg acgcagcttc 780 

tctccttctg tccttgatga cgtctcggat tgcccctcgg tctttctgga tgactctgct 84 0 

gacagacgcc ttgccccttt tggaacagaa acaggtgatt ttctcagcag aacagactta 900 

tgagttgatg cggtgtctgg aggacttgac gtcaagaaga cctgtgc 947 

<210> 804 
<211> 532 
c2 12 > DNA 

<213> Homo sapiens 
<400> 804 

cctctgccct cccaggttca agccattttc ctgcctcagc ctcccgagnt agactgggac 60 

tgcaggtgcg catcaccacg cctggntaat ttttgtattt tgagtagaga tggggtttca 120 

ccatgttggc caggctggtn tcgaactcct ggccctcaag tgatccaccc acctcagcct 180 

cccaaagtac agggnttata ggcgtgcgcc antntgcccg gccgagaaca atttntcaca 240 

agnttacttt tctagttttg ccaatgcatg gtgaaagtga acccaagcct gggaactgca 300 

ggcctagaca atgcaggrmm ykksttsamm cwsrsmsrmr smsstysmar ywmrsssagm 360 

cttggaaagg agaagtgtga ggcaggtgtg ggtaggacct ctttttagta cctagaaaaa 420 

ggctaagaaa gtggcctgga gatgtttaga aggttaaaac caacgaagaa aaaaatcaat 480 

gacaacctat caggaacgtg attgactctc agaatggaga actggcgaat eg 532 

<210> 805 
<211> 552 
<212> DNA 
<213> Homo sapiens 

<400> 805 

aatgcattnt tgatttttta ttgcagatga tgaaaaagtt ttagatatag acagtgeega 60 

tggttacaca atgttgtaaa tgtatttaat cccacttacg aatgattaaa atgataaatc 120 

ttatgtttat ttcatcacta ccaaaaggct gtgggtgcag gggtgctggt ttctggtcct 180 

agectaagag actggcagtt tccaccttct atctcttggg acagtagctc tgggagcect 240 

gagctgtcat gcaggaagtc cagctaccct gagaccacca tgctggaaag gecacaggga 300 

ggagctctgt ggacagtccc agctgaacct tgccttccag ctgtccctgt caagatgeca 360 

ggsatgtgag taaagecate atggacccty tagaccagac tgcccaccag cagggtaccw 420 

tctggcagcc acatggagca gaagaacege ccagctgagc cacttccaaa ctcttgaccc 480 

actaagtcat gatccacaat gaacccatca tagggatggt tggctttgea gtgtggataa 540 

tgaggatgtc at 552 

<210> 806 
<211> 1646 
<212> DNA 
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<213> Homo sapiens 
<400> 806 

aactagtata tttacaacat cagaaacttc aatatggaga tttgttgttc ctatatcatg 6 0 

atctttagca gcaactacac cataggcact gcacaacctg ggtcctagat caggacgtac 120 

aaaaaatcct ggcaaatgag aggccaaatt gaattttcct tctggattac aatattctgg 180 

caatggcaga ctttttaaaa gatcttcgta tcttgctggc atcatagtct tgaagtcttc 24 0 

tcctgaaggc caatctttca attttaaaac aactgtttct ccactcttgt ttttctgccg 300 

ttttgaaact tcttcaaaac catcccagaa ttccttaaca ttggcatttg aaatgatgct 360 

atctttgcag ttcaggagat cagcttggtg gtctccaaaa tcaagactaa ttgattccgc 420 

cttccatagg ctaatgttca ttttcttatg cacaccagaa accactgcag gctgtccttg 480 

tttccaacat tctttgaaaa gcttccaatt actgctattc ttataatcct taagccataa 54 0 

aatatgcttc tcacagatcc aagaatgtgg tatatcactg tataatttat tattttcatc 600 

cactgcagat attatgcttt cttcaggctc ttctttaagc tctggtttta catttatctt 660 

ggaggtttta cttggtggaa ttttgttttc aacaactgaa gcaattatgt catcaagaat 720 

gttaggcata gtccgtccac ttttgctact tggggctccc attgaatata ctggggcaaa 780 

ggcaatgcca gcatctgtas accccacacg tagctttcca gctgttgtag tcagcaaatc 84 0 

ccgtaaggtt gagccttgtt cattattctg ggacacaaga ggtgatgttc tgccatttgg 900 

agattcagag ttgtcttgtt ctctttcttc tttaatttgg ttttcaaggg taagttcttt 960 

gttttctttt ttttcctctc tggctttttg ctctgcaaga tctgctaacc agtgcagtgg 102 0 

tgactgggat tctggaggag ttaacttgtt atctgtgcct acatcactct ctgggctgct 1080 

gccaccattt ttctcagact tcggaggagt attttgctgc tgagactcag gcatgcacag 1140 

agaaatttta ttactgtgat taagaacatt ctgtaaaact tgagatacac cattcattgt 1200 

aggaaaattt ccaacttgta aattctgttt gttagtacaa tgacaatggg atttaatacc 1260 

atatttttcc ctaagagtgt gcatggcatc tagaagatct gtcaaaacag aaccaggtat 132 0 

aatttgggtt ggcattaaat gtttgtgatc atgaggctgt ccyttcacac acttcatcca 1380 

agcatattag ttctttatcc ctagramyyc tycctttcct ttngccttgt aacaatctaa 144 0 

gcaganccac aawkccacat tttkggcaga cccagtnraw kktaancawk gntgcttcac 1500 

atgcatcaca catctcccgg actcctctca ctgctctttt ccaggcaatt ttggcatcct 156 0 

ttttcaccca ggacaaagct gttttttcag atgttactaa ttgacagaac ttatcaccta 162 0 

ttatatccaa gatatattta gaagtc 1646 

<210> 807 

<211> 1029 

<212> DNA 

<213> Homo sapiens 

<400> 807 

tggggctgtg actgtattta cttcattctt gaatcccgcg tccccgtggc tgggggctga 60 

cacatccctg ggcaccactg tgacttcctg tgggtccctt cccttctgtc cctgactctg 120 

tagacccccc acaggaaggg tcctaggtag ggggaggttc ctcctccctt gaaaccctgg 180 

gccactctgt caaggcaaag ctctgggccc agcaccttgt aaaggctttg atgagaggag 240 

ctctggcttt tgctcagggc ctttggaccc caccctccag cccccaggaa tgcaggcgct 300 

caaagcctgt ggtnaggctg cccgaagcac gtgccgcagt tcttctggag tgggagcagg 360 

gggacagagc tttgggtaga ggagggtcac ctgcaaagct ggaatgccag gggagtgggc 420 

ggtgcctcca gctcctgggg gccagggtgt ctccatacct catgggcctg agcctgggca 480 

ggggtctgga gtgcacatag cccccaggca gggagagggc agtgacagga cagagccact 540 

catctgtccc aaagctgcac caaggggtgt cagcaacccc aacctactga cctactttgg 600 

gaccacaggc ccatctagtg caaatgaggc ccagaaagga gaaatgcttt gctcaacagc 660 

cacagtaggc tgacgtaacc tatgtaatgt agggtcaggg tgggcctgag ggatgancca 720 

ggtggtgggc aggtganaca ccaggtcccc tcctggcctc tgccccaccc agccctctcc 780 
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tgcacggcta ccagaagatg tccgggaaga 
aggtgcagag gagggcaggg gcccggatcc 
atcctgagat gatccaaatc aaacagaata 
tgcactctgc gcacacatat acaagacaca 



acanactagc 
tggcccagaa 
cttgacggaa 
cacacacaca 



cctgagtagg gagtgtggtc 
acactctaaa acagaatccg 
atagtagagt ctgaaaatga 
cgaatccacg cacacgaggc 



840 
900 
960 
1020 
1029 



acaccccac 



<210> 808 
<211> 836 
<212> DNA 

<213> Homo sapiens 
<400> 808 

aaaaccgggt ataacacttt aatatagatt tgtggaactc tggcccttgc agccagaata 6 0 

cacatttata agccataaat aaagcacgca gaaaccataa attaatcgga cccgagacct 12 0 

ggatttcacc gtgtcaagat tgggaatgct ttttttttct ttttcttggt catttacaac 180 

agacccttac attatttttt ttcctgtttt taaacaatag tacaaccctc tggttctgtt 240 

aaaactacat ggttttacac cgagtcactc acaaaatttt tttttttttt taagtaagac 300 

LLcCctytiaa caacatjCaac ggaggagaac aacaacaaca aaaaaatcag aatctgcagg 360 

tgcttgaaga agcaggagtc tacacagtag tggaaaccgg aggctttttt ttaactttat 42 0 

attctttccc gttttcctcc ttatatagaa cgtggggtat ctgtgtggcc ctctgtttgg 48 0 

gacggaacrg ctgcagcggg tgaaggaaga ctgctgtctt gggggtgttg gggtgggggt 54 0 

gttatggatt tcttctccct tgcgtctctg caacaccgtc tccccaaagt ctcgaccccc 600 

acttgctctc tcacttrtcc tcgatccggg gtgccagagt tagccnggcc tgaagccgtc 660 

gtcttcttaa gaggagttca taatgggccg ggagtacacc ccctggtagt aggaggtatc 72 0 

tgcggccagg ggcgaggcgt ccaggcccgt tttgttcgtg accgggccca tggccaagct 780 

gccaggcatg ggggaaccgt agccggggta gtgcatcacc tgttcgtagg ccttga 836 

<210> 809 
<211> 1844 
<212> DNA 

<213> Homo sapiens 
<400> 809 

atcaggtgtt cctcccatgg caggagggaa gaaacccagc aaacggccag cctgggactt 6 0 

aaagggtcag ttatgtgacc taaatgcaga actaaaacgg tgccgtgaga ggactcaaac 120 

gttggaccaa gagaaccagc agcttcagga ccagctcaga gatgcccagc agcaggtcaa 180 

ggccctgggg acagagcgca caacactgga ggggcattta gccaaggtac aggcccaggc 240 

tgagcagggc caacaggagc tgaagaactt gcgtgcttgt gtcctggagc tggaagagcg 300 

gctgagcacg ccaggagggc ttggtgcaag agcttcagaa aaaacaggtg gaattgcagg 360 

aagaacggag gggactgatg tcccaactag aggagaagga gaggaggctg caacatcaga 420 

agcagccctg tcaagcagcc aagcagaagt ggcatctctg cggcaggaga ctgtggccca 480 

ggcagcctta ctgactgagc gggaagaacg tcttcatggg ctagaaatgg agcgccggcg 540 

actgcacaac cagctgcagg aactcaaggg caacatccgt gtattctgcc gggtccgccc 600 

tgtcctgccg ggggagccca ctccaccccc tggcctcctc ctgtttccct ctggccctgg 660 

tgggccctct gatcctccaa cccgccttag cctctcccgg tctgacgagc ggcgtgggac 720 

cctgagtggg gcaccagctc ccccaactcg ccatgatttt tcctttgacc gggtattccc 780 

accaggaagt ggacaggatg aagtgtttga agagattgcc atgcttgtcc agtcagccct 840 

ggatggctat ccagtatgca tctttgccta tggccagaca ggcagtggca agaccttcac 900 

aatggagggt gggcctgggg gagaccccca gttggagggg ctgatccctc gggccctgcg 960 

gcacctcttc tctgtggctc aggagctgag tggtcagggc tggacctaca gctttgtagc 1020 

aagctacgta gagatctaca atgagactgt ccgggacctg ctggccactg gaacccggaa 1080 
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gggtcaaggg 
caccaatgct 
ggcccgccag 
cagtgtattc 
ccccctcagt 
ccccggggag 

gggctggtta 

ctgacctacc 
atttytccay 
gtgaaccagt 
tgtgtgtgtg 
gggtgggagg 
caaataaaga 



ggcgagtgtg 
cgatatgtcc 
aatcgggctg 
cagctacaga 
cttgtggacc 
cgggaacgct 
tcatggccct 
tgctgcagaa 
tggaagagaa 
gtgttattgg 
tgtgtgtgtg 
gttgctggag 
atagtttggt 



agattcgccg tgcagggcca 
ctgtctcctg tgagaaagaa 
tggcccgcac agcccagaat 
tttctgggga gcactccagc 
tggccgggag tgagcgactt 
tcgggaaaca caggccatta 
gagcaacaag gagtcccacg 
ctctctgggt ggtagtgyta 
cgtytccgag tccctcaact 
tactgctcag gccaacagga 
tgtgtgtgtg tgtgtgtcct 
ggtgctttat tgggtggagg 
ttttttttta aataaaggtt 



gggagtgagg agctcactgt 
gtggacgccc tgcttcatct 
gaacggtcat cacgcagcca 
cgaggcctgc agtgtggggc 
gaccccggct tagccctcgg 
acagcagcct gtccacgctg 
tgccttaccg gaacagcaaa 
agatgctcat gtttgtgaac 
ctctacgctt tgcctccaag 
agtgaagacg gatccagatc 
atgtctatgt atcgggtgag 
gcaccatgtc ccagggctat 
ttat 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1844 



<210> 810 
<211> 489 
<212> DNA 
<213> Homo sapiens 

<400> 810 

gccccgctcc atgagcagtg actccccagc tcctcctggc accagtcccc agggctctcc 60 

tgttggtakw wmmwgctwyw ywtsyysswm mywmmycgkg racctcraga tctyyaccct 120 

aaaatarttc tgttgaattt caccctggcr atgtaaaytg akagcttatc ttcacagatg 180 

ysrganaakr gmcmayycmy cwkcaswcct swgncwmays tswrwcwrat ksmtkycykw 240 

kccctattta tgtaaaaata cagggtccct gagccagcct aaggcataag tgacttatcc 300 

ctcctccctg ctcacatata aattgtgtat ttagtgaaag gctgatcaaa grttcaaagr 360 

atgttatttg ttatctacct gtggacccag naggtcccca attccagtta tttccacctt 420 

tccaggaccg ggaccaatgt atatatgtaa ctggattggc tgttctcgtg tgtttgttaa 480 

aatgtgtgg 489 

<210> 811 
<211> 471 
<212> DNA 
<213> Homo sapiens 

<400> 811 

gccctcagcc acccccatcc ctgccccttc tgagactcac agcacccctt tccttcctct 60 

cctcccacct cctccctcag cccctcattc tccttgggaa tctgcagagg gctctgggac 120 

tcactgccgg atgtgaaatc caggcgtcag ctgtttccta ggcaagggca ggaaagtggt 180 

ctccagccct tgctccactc atgcctgggg gnctgggsyy gagtggtatc cctacctggc 240 

ctccccctgg cctctggcct ccagcgctgg gtttgtcgag tgagagagag agaggagctt 300 

gggttgcttc cctgtccccg ccccctctgt ggcattgtcc ctcccactct tatttttcta 360 

ccaattgcta tttttccgaa caatccttgt agagtatgta ccatccaaag gcaggagggc 42 0 

cctcggtggc cggctctggt tggagatggt acagttttat tgtacaggtg c 471 

<210> 812 
<211> 579 
<212> DNA 
<213> Homo sapiens 

<400> 812 
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cccaatgaat caacatactt tattagaccc actaagtgcc aggggagggg cctgtgccta 60 

ngagccaggt tacagggctc acccgtagat tcagtctggt ctctccccat catgcctctc 120 

acttccagtc tgggcttcta ataggagggc cccgacttct tccctcccag tcattctctc 180 

gaatggagaa tctttcctca ttccagggac accaaggctc aggaaggggc ctatccatca 24 0 

tcagtagagc cagacaagct ctcccatcgg acgtcctgtg gctgggccca gaaatgggtg 3 00 

ccgctgcctg tgggactgcc cttccgggaa ggaccagggt gtcttcagtg ctcttggcct 360 

gcacgtggna ggagagtagg cagatgtctg gtgctcttta agctcaaagg catcatggcc 420 

ctctckgnwg sarcrrrsrs akamragkym sssatcncag scagcscwnk arskstsgca 48 0 

nwsmwcatts casmtgcasc mmcmggrrrs mkcsksywcm kmagnsktnm scmtsgsrgy 540 

cagcgcagcg tagggtggca tcctcattgc agatgcagc 579 

<210> 813 
<211> 562 
<212> DNA 
<213> Homo sapiens 



<400> 813 

tgtacagact ttatttatac acatataata tacaccatat atacttattt atagatattc 120 

acacaccagc ccacacactc gcacacactc acacgcacac acccttccag gaggggcgtg 180 

tggctgcctt ggagtcccgc tagscccaaa caagtgatac tgggcttgcc aggcagttgt 240 

gaggttttgt gttttttgct tttaaaaaga aggccatttc ctccagatgt gtcctccctc 3 00 

tccccaagcc ctaaaactcc tccccaaaac actctgaaaa aaattttttt aaaacaagrg 360 

gnttttcctt tgctytggsc caagtagttt ctngganagn tccrggscca tccacaagny 420 

ccgtgcaggt cctagagcac gagagccggg cgtggccttg gtcaggcctg cagctgtgcc 480 

ctctgagggg agaggggagg cgctatagca tcaagggcac ctgccagatg aggagggtgc 540 

tgtccgtctc cccacacggg gc 562 



<210> 814 
<211> 594 
<212> DNA 
<213> Homo sapiens 



<400> 814 

agcctcgcct gggccggcct gtggctccca ttttcctttc agcgggacaa aggggacttg 60 

ttaccaggcc attttctgga tggcctgtga gatctctgcc cctccaagac cckccaaryc 120 

tsmsyckgwc scmswgytsk smsmmwgmmt ycwgcmsygs smrccntgss rryktswrkc 180 

tggcaccagg ctgnagnctc cccaatccca gcccactttg ctgtgtctct ggcgggctgt 240 

cctccttggt gggagctgtc ctgcacactg taggatgctt aaaggtatcc ctkgcctcca 300 

cccaccccta gccagcagct cccagtcaga caacagccag awatgtctcc agactctgcc 360 

cagcctcccc aggtagccac cctcgagaca cgacctcaga gtctctgtgt ctcctagaag 420 

cctgacagag acccccaggg cagtgggtgg gtngcgggct agagaccctt gcctgtntcc 480 

gggaccctgg cgccgctctc ccctcctgtg gatccctccg gactaacagt gttcttagtn 540 

ggcagangct ggggcacccc ttnggccctg ncaggcatng ccattggcgc angc 594 

<210> 815 
<211> 812 
<212> DNA 
<213> Homo sapiens 



<400> 815 
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aaaaccgggt 
cacatttata 
ggatttcacc 
agacccttac 
aaaactacat 
ttccctgcaa 
tgcttgaaga 
attctttccc 
gacggaacrg 
gttatggatt 
acttgctctc 
gtcttcttaa 
tgcggccagg 
gccaggcatg 



ataacacttt 
agccataaat 
gtgtcaagat 
attatttttt 
ggttttacac 
caacagcaat 
agcaggagtc 
gttttcctcc 
ctgcagcggg 
tcttctccct 
tcacttrtcc 
gaggagttca 
ggcgaggcgt 
ggggaaccgt 



aatatagatt tgtggaactc tggcccttgc agccagaata 
aaagcacgca gaaaccataa attaatcgga cccgagacct 
tgggaatgct ttttttttct ttttcttggt catttacaac 
ttcctgtttt taaacaatag tacaaccctc tggttctgtt 
cgagtcactc acaaaatttt tttttttttt taagtaagac 
99^ggagaac aacaacaaca aaaaaatcag aatctgcagg 
tacacagtag tggaaaccgg aggctttttt ttaactttat 
ttatatagaa cgtggggtat ctgtgtggcc ctctgtttgg 
tgaaggaaga ctgctgtctt gggggtgttg gggtgggggt 
tgcgtctctg caacaccgtc tccccaaagt ctcgaccccc 
tcgatccggg gtgccagagt tagccnggcc tgaagccgtc 
taatgggccg ggagtacacc ccctggtagt aggaggtatc 
ccaggcccgt tttgttcgtg accgggccca tggccaagct 
agccggggta gt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
812 



<210> 816 
<211> 999 
<212> DNA 
< 2 1 3 > Homo s ap i ens 

<400> 816 

aagccgcctt ctgagccttt ngcctctgtt gttcctcctg ctgcctgtga gttttcatgt 60 

gtgcatttcg gcttttgatc ttgaagaaga ctttgccnca ctccttgcag gggaagatgg 120 

tggtggggtc tgtctcgccg ctggtggtgc tgtgagaggg tgancncttt accncnacag 180 

tacccactct gggtgccncc aggcttctgc ttcccagags gkrtrrmmmc kmgggccttg 24 0 

ctttgccccc tgnaaaagct gccccctanc catagtatct cccaggcaaa gatgccatgc 300 

tcactgcaaa ctatggaatg aggtcagaac agaatcaaag taacgcttga tgggaaaagt 360 

tggccccaag accccagtac taagagggtc gcctgcgtct cacacacaca cactcacagc 42 0 

aagctttggg ataaaaggca accgggatgg ttgacatctg aatgcaatgg aacatgaagg 48 0 

tcagcttcag tccctactgg gaatgatttc atgagaaggt agcccagatg aaacacctct 54 0 

taaagatagt tgtgccaatt atttattccc ccaacccccc acaaaaacaa atttttttaa 600 

ataaaaggaa aagaaatagg attttttttt ctaaacctga ataaaatgac cacttttaaa 660 

acagrtagtt taaaagggtt acaaaacaag caggcagtcc aggtttcctg attaatgaag 72 0 

atggaggccg tgggttttca ctgtctctaa gtgacacaca gggctttata gttctgcgtc 780 

accctgaagc aagactgaat cttgatcatc caagagaaga tcggtgtcca caacttcagc 84 0 

ctcttccatg acacctccca actgctggac gacgtcgtcg tcgaggatgt ccacatcctt 900 

gtatgggttt gatcagactc agctggtcca ggggcagcag cmcgrcagca ccccacgggc 96 0 

ccgtagtcct ctcaatcgtg gctgccatct cagctgcaa 999 

<210> 817 
<211> 653 
<212> DNA 
<213> Homo sapiens 

<400> 817 

atttttaywt ttaaaacatt ttatgaggga taaaatatag tctttttcta tcagtatgtt 60 

cacacttcct ggcctctcat tgggaagctg taagatgtcc ttcaataaga tcctgaacac 120 

gcgacagaat aatctcatta gagctgctgc aattttctgg accatatggt gggtctatag 180 

tcaggacccc agccacacag agagtccttg gagcgtctcc ctgttcagtg atggggatgt 240 

ggttcttctc aagccatttc tttaggctgt tctttctctc ttccagatcc tctgggctgt 300 

atgctttgca gtctccagac gtgaacaaat gcatcagctt ctccctcact ctatggtccc 360 
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cttcattcat agtttcaaca gtckgcacag catgtcccat aattccggtc acagacatgc 42 0 

tgccatcttc aaggaagttc acaaggacaa tattggcaga gactgggtct gkagttaaam 480 

cccatccttt atactcattc ttctcactgg ctgtcactcg gacctctttg taaatgtaat 540 

cttgccattc taaggggcct ttcttcatcc attcactcat gattgccacc tggctaaatc 600 

agttaaaaaa ctcctcgcaa ctctgggtac tcagcaacca tgctttgagg aag 653 

<210> 818 

<211> 1225 

<212> DNA 

<213> Homo sapiens 

<400> 818 

ggattctttc actgagcaca aagagttgtt ggggctttag catctgactg attttgttac 60 

ggggttgatt ctgaccatag gaagtatgca atgtgaatca ctatttacag agaaacctac 120 

aacagatgct tgatgttgta gaaactggga catatagata ccaagcaaaa ttataagaaa 180 

cctataaggt gttcaatacg cttgtgtttc caaaattcac tgtacatgat cagtttggtg 24 0 

ttcttgtacc acagttttta actgaaggaa ccagttgtaa cagtctcaat tttaactaaa 300 

aCuuC^aa^aa Ci.adadCaaC aau^CaddCC LLLCa^CaLL. Cj" i_ i_ uyy CCaa aCuuC[Ludad oou 

actgtaatgc aagaaccaaa tgcacbgtga tgtggcacca actaattagc aagcatgaat 420 

ttttcaccca agagtgaaaa aaggaaaatc taccatggct tgaagttaaa gagcagaact 480 

cctgactacc attctatgac tgatcaaaag actaatagtt aaaaacctca gcaggccttg 540 

ttcacgatat gcagaaaaaa aagtgctgca gtttagatac ctctggaatt tttccacagt 600 

gtcacaggtt tgtaatactt gaagccctac atttctaaga atatatttct tgctcagttg 660 

tttcakgcaa gcccaagact ttgtaatttt taaagggccc aagatttttt tttttttttt 720 

tttttcaaat aacagaccag cttctttttc ttgcagttac agatgtaatt tcctttttgt 780 

tgtcaaacat aaggtaccaa atatgatgca ataaattgtt ttgaaaaaca gttgtgtgaa 840 

tatttcaact aatctgtgtt gggcttctgt gaaatacaca ggtggaaaca gaggtgcaag 900 

ccagagcaat ngtaatatgc tgtaaggcta gtgcagatgg gagcttttta gaaggggcfca 960 

agtgctggtg tcagggaaat tccataatga agtagaatgc tgctcctgca ttaagatttc 1020 

attgagggca aggctggtgg caggtactat gaatgtaatt cataatttaa aaggaaaact 1080 

aaaaactatt ttgatttggg aaaatgagcc ttaatttgtt aaacctatac actgaggaac 114 0 

tagcctcagg ctttaatatt ctcattggca tttgccaagg tcctgaggcc aaataaggtt 1200 

taagttaaaa caaatccaat tgtnt 1225 

<210> 819 

<211> 1024 

<212> DNA 

<213> Homo sapiens 

<400> 819 

gacaccccag atgcagccac caccagcaga agcgatcagc tgaccccaca agggcacgtg 60 

gctgtggccg tgggctcagg tggcagctat ggagccgagg atgaggtgga ggaggagagt 120 

gacarggccg cgctcctgca ggagcagcag cagcagcagc agccgggatt ctggaccttc 180 

agctactatc agagcttctt tgacgtggac acctcacagg tcctggaccg gatcaaaggc 240 

tcactgctgc cccggcctgg ccacaacttt gtgcggcacc atctgcggaa tcggccggat 300 

ctgtatggcc ccttctggat ctgtgccacg ttggcctttg tcctggccgt cactggcaac 360 

ctgacgmtgg tgctggccca gaggagggac ccctccatcc actacagccc ccagttccac 420 

aaggtgaccg tggcaggcat cagcatctac tgctatrcgt ggctggtgcc cctggccctg 480 

tggggcttcc tgcggtggcg caagggtgtc caggagcgca tggggcccta caccttcctg 540 

gagactgtgt gcatctacgg ctactccctc tttgtcttca tccccatggt ggtcctgtgg 600 

ctcattccct gtgcctntgg ctacagtggc tctttggggg cgctggccct gggcctgtnc 660 
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aaccaccggg ctggtaatca ccctctggcc cgtggtccgt gaggacacca ggctggtggc 
cacagtgctg ctgtccgtgg tcgtgctgcn ccacgccctc ctggccatgg gctgtaagtt 
gtacttcttc cagtcgctgc ctcnggagna cgtggctcct ccaccccaaa tcanatctct 
gccctcaaac atcgcgctgt cccctacctt gccgcagtcc ctggccccct cctaggaagg 
nccgggtccc acaggcaaca cctaagtgga ccaacccctc tgcctgtcct gccccccaga 
cgatgactga aggctccttt gacaccttga gatgantctg ctactttcca gacttttctt 
acaa 



720 
780 
840 
900 
960 
1020 
1024 



<210> 820 
<211> 631 
<212> DNA 
<213> Homo sapiens 

<400> 820 

atttttaywt ttaaaacatt ttatgaggga taaaatatag tctttttcta tcagtatgtt 60 

cacacttcct ggcctctcat tgggaagctg taagatgtcc ttcaataaga tcctgaacac 120 

gcgacagaat aatctcatta gagctgctgc aattttctgg accatatggt gggtctatag 180 

tcaggacccc agccacacag agagtccttg gagcgtctcc ctgttcagtg atggggatgt 24 0 

ggttcttctc aagccatttc tttaggctgt tctttctctc ttccagatcc tctgggctgt 3 00 

atgctttgca gtctccagac gtgaacaaat gcatcagctt ctccctcact ctatggtccc 360 

cttcattcat agtttcaaca gtckgcacag catgtcccat aattccggtc acagacatgc 420 

tgccatcttc aaggaagttc acaaggacaa tattggcaga gactgggtct gkagttaaam 480 

cccatccttt atactcattc ttctcactgg ctgtcactcg gacctctttg taaatgtaat 54 0 

cttgccattc taaggggcct ttcttcatcc attcactcat gattgccacc tggctaaatc 600 

agttaaaaaa ctcctcgcaa ctctgggtac t 631 

<210> 821 
<211> 635 
<212> DNA 
<213> Homo sapiens 

<400> 821 

aggttgctca cctgaaggag cacaggaggg ttttccaggc catgtggctc aggttcctca 60 

agcacaagct gcccctcagc ctctacaaga aggtgctgct gattgtgcat gacgccatcc 12 0 

tgccgcagct ggcgcagccc acgctcatga tcgacttcct cacccgcgcc tssgacctcg 180 

ggggggccct cagcctcttg gccttgaacg ggctgttcat cttgattcac aaacacaacc 24 0 

tggagtaccc tgacttctac cggaagctct acggcctctt ggacccctct gtctttcacg 300 

tcaagtaccg cgcccgcttc ttccacctgg ctgacctctt cctgtcctcc tcccacctcc 360 

ccgcctacct ggtggccgcc ttcgccaagc ggctggcccg cctggccctg acggctcccc 420 

ctgaggccct gctcatggtc ctgcctttca tctgtaacct gctgcgccgg caccctgcct 480 

gccgggtcct cgtgcaccgt ccacacggcc ctcgagttgg aacgccgacc cttacgaacc 540 

ctgggagagg aggacccagc ccagagccgg gctttgggag agttccttgt tggatttttc 600 

agggccttnc agcggcatta ccaacttgag gtttt 635 

<210> 822 
<211> 752 
<212> DNA 

<213> Homo sapiens 
<400> 822 

tgcttttatc ttgaatgtag ccttcaactt tgtgtaattc cttaccaaaa aggccacatg 60 
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gcttaaaatt 
actgttcgat 
atgtatatgc 
ctacactctt 
tgtttaatcc 
tgagtctaaa 
aaggttttcc 
actgcatcct 
ccatgtattc 
tgatcttgga 
ttctggaaaa 
catctccgcc 



caacacacat 
ccacagttta 
ctcattgtgt 
cccccagaat 
ttcagcatga 
tccaaggtca 
agtccgttcc 
ttccacagga 
agttaggcgc 
tagttccatt 
cataggagaa 
accacagtct 



ttgtccccag 
cccacaatga 
tcaaggagct 
ttcagtttgg 
aatgcactga 
tctcgcacta 
cactgagaag 
tcagagagtg 
tgtaggaagc 
ccaatacatt 
ggcaaacttg 
ttttcttcgg 



tcttgtggtt 
tattatgcac 
ccaaggtgat 
gatagataga 
ttggtgggtg 
attcataagt 
caacagcaga 
aactggcctt 
tcagaggctc 
ttctgaggat 
ttctgtgttt 
ac 



tataatttcc 
acagcaggtg 
ggttcctttg 
gccatgaaag 
atggctgacc 
ctctcccagc 
tacagcaaac 
gtggaygagg 
attaaatata 
gctccagata 
cttgatgcca 



acattgccat 
ggatttgtcc 
ggttctgctt 
atgaagtcat 
tgttcggaga 
agtgggttga 
gcagctacac 
taagtatgct 
actggcatcg 
ctgaagtcat 
ttggagagag 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
752 



<210> 823 
<211> 899 
<212> DNA 
<213> Homo sapiens 

<400> 823 

tttgccacag ggtaaacttt tattttagaa tccaatcttt tccccacaca tacacaataa 60 

attaaacaga atccacagta aatgtacatt ttttaacata aaaagtcagt tactgttact 12 0 

tcatgatcac atgaggatcg tcacagctcc gtgtccatta gcacattacc ctccttgtcc 180 

ttaactctta tccgaccgga tctgtacttc gtttcttgat gaccgtttgc atatacggtt 240 

ttaacagtgc catctgggta ttcccgtctc ttgaactggg cagtatgtag ttctctttgg 300 

ccattattaa actctatgag tttgttgcca tcacgttgta ctctgacaat tgtaccatct 360 

gggaaaatgc tttcttcttg tccatcagga aataagtttt taacagtctg gtcaggaaac 420 

gtgatttctt ttcttccatc tgggtaatgt ttttctrttt aaaaagttgt tacagtaaat 480 

attttttgaa ggaagggaag aatttaatga gagggtggag caagtttgta cctatttgtc 540 

cacttgagaa atgtaagact tccagtcctc cgggtatgtc gtgtgagtgg tctgggcagc 600 

tgcatagtag tagatctgta aagacacaca gtcagtctgc cttttctcca gagatggtta 660 

aactatggag gagaacactt ctggaaacat accactcttt ggtctggcat gacctgcttc 720 

acgtcaccat taaagaaagt gacagtgatg gtcttcccat ctgcactcac ttcctttcga 780 

gttccattgg gaaacagtat aacacggcac ccattcttat aaaccttttc cacctttcca 840 

tcaggatgac tgatttctcc ctgtatgtct tggtcttcct cctcctcttt atattcagg 899 

<210> 824 

<211> 1980 

<212> DNA 

<213> Homo sapiens 



<400> 824 



acccgtccgg ggccggccaa tttgcatatt 
ttttgcagcg atttcttaga tgtaaaaatg 
ctcksmwytc ysskwswskm tstgcccrga 
aactccggcc caacctaggc gcgcanccgc 
aagccgccgg ggagccggga accccagcat 
aagagcagca cggtacatta tttcacttgt 
ctcaccctca agaagtgatt ggtatggtaa 
cgagcgtcgc cgcgcggaga agccagctgt 
ggcaattcca aaggtgaaaa cgcagccaac 
cgctgcccct acacgaagca ccagacactg 



tggaatgcgc 
agatctcaat 

Sjctggtttcc 
sacgagatgg 
gattcttggc 
cccgctcccc 
tttaaagcaa 
cccttggcag 
tggctcacgg 
gagctggaga 



cgctataaac 
agcagcgggc 
gtctctcggc 
cgcacttccg 
ctttgttcgc 
ttcataacag 
cgcgcattcg 
tgatttcgga 
caaagagtgg 
aggagtttct 



ccggctgggg 
tgggcacatc 

tcggggctgg 

atcaatgtca 
ttctgatact 
aaaaagggga 
ctaggcctcg 
aatgtgtcaa 
tcggaagaag 
gttcaatatg 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
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taccttactc gagagcggcg cctagagatt agccgcagcg tccacctcac ggacagacaa 660 

gtgaaaatct ggtttcagaa ccgcagatng aaactgaaga aaatgaatcg agaaaaccgg 720 

atccgggagc tcacagccaa ctttaatttt tcctgatgaa tctccaggcg acgcggtttt 780 

ttcacttccc gagcgctggt cccctccctc tgtcttcagg ctctgccagg aactcgcacc 840 

tgtgctggag ccctgttcct ccctcccaca ctcgccatct cctgggccgt tacatctgtg 900 

cagggctggt ttgttctgac tttttgtttc tttgtgtttg cttggtgctg gttwatttgt 960 

tgttttctgg gggaaaaagc catatcatgc taaaattcta tagagataga tattgtccta 1020 

agtgtcaagt cctgactggg ctgggtttgc tgtcttgggg tcccactgct cgaaatggcc 1080 

cctgtcttcg gccgagcntg gtttcctgcc cagcctgggg caaacctagc cgaaggccga 1140 

ggtcccattg ttggcgctga ggtgtctggc ctgaggtcaa tggtgcaaag gagccgccac 1200 

cggcatgtct gcctggagtg ctgtgctgtg tttaatcagg ggatacaggc ccctgggttt 1260 

cttttttctt tcttcctttc ttccttggcc aagagaaggg cttacaggca tggacatgca 1320 

ggttggcaaa cgggcttgac tttggctgat ttaaaaagtg agaaagaaag taaaaaaggt 1380 

taatttttcc ttcctctgta agatatccca gctttaaaaa gaaaaaaaaa aagaattacc 1440 

aagagaaggg gacttctctt ccagtttctg taaggtctta cattgcctga ctaaaatgtt 1500 

tcatttacct ctaaatttcc atatccttct ggctgtagat aaataatgta gttttgttta 1560 

tgcatttgga attagtggat ttttttgtca ttaaaattgt taccactggt aacatgtgac 1620 

aagcacacca caattctccc tatcttgtga agttgttttt ttaaatcgcc ttgaacaaaa 1680 

agtttttttt tttgtttgtt tttgctttct gaaattcaca gaagcctagg aggactgggg 1740 

taagcggaat aaactagaga agggagacat tgtttggatt tccttcctat aaatacaaat 1800 

ctgtataaat gtctattatt atgaagaatt gccaatcttg ttttaagcaa atgcattcta i86 0 

tcgttattat aaatgttagt tctagctcta tttacttcta atcttaaatc agaataaatt 1920 

aatattgtat tgctgctgtg cgtggaaaaa gacgatgttt atgttcttat agaataaaag 1980 

<210> 825 
<211> 333 
<212> DNA 
<213> Homo sapiens 

<400> 825 

tctagatatt gcccaatcgc tgcccacagt gcacatacct ttccaccagt cacatgtgag 60 

agggcagatt ttccaaatgc tcatcaccac ttggcactgt gtggactata attttggcca 120 

gttaggaaat ggcatctcat tgttttcatc ttaatttgcg tcagcctgat tactcattga 180 

aacttgtgag gttgagaaac ttttcttaag cttattggcc attcaagttt cctcctttat 24 0 

gaaatggttg ttcatgtcat ttkctcattt ttatattaga ttgkwtttmt wttttccagc 300 

tgacttgtag gaactctaca tcttatcaat att 333 

<210> 826 
<211> 658 
<212> DNA 
<213> Homo sapiens 

<400> 826 

tttttttttt tttttttttt ttttgaaggc ttcatgaata atttattcca tttgaagttt 60 

tgttttttgt ttttgttttt ttttttttaa aaagtataaa ccttttcatt tcctcaatca 120 

caatttgtac aactcagtgt tatggcattc ggcagcaata gtgtttgttc cttattctct 180 

ttttgtcacg ttaaaaanaa agcaattgga ccatattaaa tgtcactgct aaacaacaac 240 

tttaaaacgc cccttcataa agtgaccaag ctattttgag agggttgatg ctgacatgtc 300 

cagtaatgac gttacaattt gtagcttaaa ctcaataact ttaaggtcca catatccagt 360 

ttactttgaa aactaaagat gttttaaaac ttcatgaata catcaacctg aggagtattt 420 

taggkcccaa atccagtttt taaatttata ctccacnaaa aangaaaata catacataaa 480 
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awtttaaacc mcngttytgg gcccattwaa acaccmaaaa agaccccccn aaaagttaag 54 0 

anttccagct tanttctgga ngggtgggnc aaaatarraw kktwtawwma wwwymytwwt 600 

ccnkmattca gacaaactaa aatcttaaga ggaaacccag accaaaatat cactcatg 658 

<210> 827 
<211> 453 
<212> DNA 
<213> Homo sapiens 

<400> 827 

attatagaga ttaatctcct ttgctcgaag tctntttaaa tattagtcac atctaaaaca 60 

tacttttaca gcaacatcta gactggtgtt tgaccaaaca actgggcatc atagctgaca 120 

cataaaatta accatcacaa ccatgttcta ggcactgttc ctcactgcct gagaagacac 180 

cgttatgttt attagggttt ttgagtttta tccacagctt ttggttatct gcaaccatgt 240 

ctcccaccat taacatagtt cacactgaga tgaggattcc ctatttaaca cttggtccca 300 

acttcttcac agtccatctg gttttgtaga gggaacataa ctggacattc tggtcaggtt 360 

aggtgaggtc aggccttcag gacgctattt tcactgagtt gctttataag gcacattatg 420 

caaaattcca ccagctcctc cgtbcaoccic at-t. 453 

<210> 828 
<211> 657 
<212> DNA 
<213> Homo sapiens 

<400> 828 

aagagaagga cctagagatt gagaggctta agacgaagca aaaagaactg gaggccaaga 60 

tgttggccca gaaggctgag gaaaaggaga accattgtcc cacaatgctc cggccccttt 120 

cacatcgcac agtcacaggg gcaaagcccc tgaaaaaggc tgtggtgatg cccctacagc 180 

taattcagga gcaggcagca tccccaaatg ccgagatcca catcctgaag aataaaggcc 240 

ggaagagaaa gctggagtcc ctggatgccc tagagcctga ggagaaggct gaggactgct 300 

gggagctaca gatcagcccg gagctactgg ctcatgggcg ccaaaaaata ctggatctgc 360 

tgaacgaagg ctcagcccga gatctccgca gtcttcagcg cattggcccg aagaaggccc 420 

agctaatcgt gggctggcgg gagctccacg gccccttcag ccaggtggag gacctggaac 480 

gcgtggaggg cataacgggg aaacagatgg agtccttcct gaaggcaaac atcctgggtc 540 

tcgccgccgg ccagcgctgt ggcgcctcct gaccgtcgtc tcctcactcc gccttttcaa 600 

atttttgtat aaccccgtgt tgtgtaaata cagtttttgc tccggtaaaa aaaaaaa 657 

<210> 829 
<211> 775 
<212> DNA 
<213> Homo sapiens 

<400> 829 

ggtttgagaa aatcaattca aatctgnccc ttctgattgc anctctaacc aggttctgan 60 

cggtgtcaga gacttcccaa tacatttccc ttctagnatg cctcataaat ccactcaaaa 120 

gtaagacacc aaacacacac ctcatttcct gaactgtgac ttccaagctg acatttttct 180 

gagaagcata attattggtt tcattgacaa ttaagttgaa tgtttcatca tcaaaaaata 240 

attcaaaaag ctctactggg ttcaactttt cgctcttgag attcaaaagt ccagaatcca 300 

gtgctgacca gcttggaaaa ttgggtttaa tgtctctttt ggtccaactc ttttctggga 360 

aacatgatac ctttaacttc ttttgagcag gctggatctc aggctcatta tctttttcca 420 

catctgagtc accagagaat gagaggcctt ggagcagttc actcactcga gctttgtctt 480 
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tttttctccc ttttcgggta atgtctcctg cagcatattc cagggatgag atgtgcatgc 
gggcccacaa atcacctggg tgacggtcct tcagagtgtt caaatgtgca actgtccttt 
cagtagcaat aggagtacta caaggaatct ggggtgcaca ctctctgttg ggctttcctg 
aggcttctcc actttgttcc atttcttcag aagtttcttg ctttgcttta aacaatctat 
ctttagttac aatttcttca gctgggtgta gccccagctt tttagaaggc tgagg 



540 
600 
660 
720 
775 



<210> 830 

<211> 413 

<212> DNA 

<213> Homo sapiens 

<400> 830 

agagcctgca agtgacaaag gaagtgaggc agaggcccac atgcccccac cgttcacacc 60 

ctacgtgcct cggattctga acggcttggc ctcggagagg acagcactgt ctccgcagca 120 

gcagcagcag cagacctatg gtgccatcca caacatcagc gggactatcc ctggacagtg 18 0 

cttggcgcak agcsmcasgk gcagtgtggc ntgctgcccc ccaggaggcc tgaggctggg 24 0 

tctcactgct ctgaaaagac acaaccagaa tggcctgggg ctcaggccct tggctgagtg 300 

ggaatgcgtt gggactgccc agctgagcta tcaggtgccc atcttttctg gtmccagcag 360 

tggtgaggag agcacaggca ggcctcgccc ctcccttgct canccagttt ccc 413 

<210> 831 
<211> 876 
<212> DNA 
< 2 1 3 > Homo s api ens 

<400> 831 

gctgacctac agcagaagct gctggatgca gaaagtgaag acagaccaaa acaacgctgg 60 

gagaatattg ccaccattct ggaagccaag tgtgccctga aatatttgat tggagagctg 12 0 

gtctcctcca aaatacaggt cagcaaactt gaaagcagcc tgaaacagag caagaccagc 180 

tgtgnykaca tgcakaagat gctgtttgag gaacgaaatc attttgccga gatagagaca 240 

gagttacaag ctgagctggt cagaatggag caacagcacc aagagaaggt gctgtacctt 300 

ctcagccagc tgcagcaaag ccaaatggca gagaagcagt tagaggaatc agtcagtgaa 360 

aaggaacagc agctgctgag cacactgaag tgtcaggatg aagaacttga gaaaatgcga 420 

gaagtgtgtg agcaaaatca gcagcttctc cgagagaatg aaatcatcaa gcagaaactg 480 

accctcctcc aggtagccag cagacagaaa catcttccta aggataccct tctatctcca 54 0 

gactcttctt ttgaatatgt cccacctaag ccaaaacctt ctcgtgttaa agaaaagttc 600 

ctggagcaaa gcatggacat cgaggatcta aaatattgtt cagagcattc tgtgaatgag 660 

catgaggatg gtgatggtga tgatgatgag ggggatgacg aggaatggaa gccaacaaaa 720 

ttagttaagg tgtccaggga agaacatcca agggtgttcc tgcaagggct ggtgtgggaa 780 

ccangccagt gtggggttcc aggnaagcca aaagtncaga ctggtggtgt tgactgtttg 840 

ctgtgacccc cacaaagttt ncggaaccgc ccacca 876 

<210> 832 
<211> 768 
<212> DNA 
<213> Homo sapiens 

<400> 832 

tagacataga aaacatacag taagaatatg gtattataat cttacggsam mamygysrmm 60 

trnsckkknw rwmktkgwaa agykgyrmyr sgrcsyanra mtanmmmtas ctrgytrrky 120 

mrywtwwmma tycctksccm gggagtttga aatttnatac tatagaaata actttaggtt 180 
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ttaggtagag 
gaaagattgg 
tgtgttggtg 
tcattataac 
gaattagcac 
ggaaacattg 
attactttga 
tcatgcatta 
aatatacata 
acccataaaa 



ttaaagaggt 
attttcctac 
tttattacta 
ccaatttcca 
agttaagttg 
gaaagatggg 
aaattctaaa 
agactgtttt 
tttattttcy 
ccmatataac 



aaagcacatg 
ctttagagat 
gttcagatga 
cttatttgaa 
acactagaaa 
gaaaaaaatc 
cttctttctg 
caaagctttc 
ttaaagcagc 
agcaggggta 



ttgnccacaa 
ctaaaaaaaa 
gtggctgctg 
ctcttaagtc 
ctgcccattt 
ttattttaaa 
tttccaaaac 
ctcacatttt 
tatatcccaa 
ttggagcagc 



ncccaggaaa 
tttaatataa 
aaggggcccc 
ataaatgtat 
ctgtattaca 
atggcttaga 
ttgaaaatat 
taaagtgtga 
cccatgactt 
tttctcaa 



gtatttttaa 
aaaatcattt 
cttgtcattt 
aatgacttat 
ctatcaaata 
aagttttcag 
gtagatggac 
ttttcctttt 
tgggrgatat 



240 
300 
360 
420 
480 
540 
600 
660 
720 
768 



<210> 833 

<211> 1604 

<212> DNA 

<213> Homo sapiens 



<400> 833 



atctttagca 
aaaaaatcct 
caatggcaga 
tcctgaaggc 
ttttgaaact 
atctttgcag 
cttccatagg 
tttccaacat 
aatatgcttc 
cactgcagat 
ggaggtttta 
gttaggcata 
ggcaatgcca 
ccgtaaggtt 
agattcagag 
gttttctttt 
tgactgggat 
gccaccattt 
agaaatttta 
aggaaaattt 
atatttttcc 
aatttgggtt 
agcatantag 
gcaganccac 
atgcatcaca 
ttttcaccca 



gcaactacac 
ggcaaatgag 
ctttttaaaa 
caatctttca 
tcttcaaaac 
ttcaggagat 
ctaatgttca 
tctttgaaaa 
tcacagatcc 
attatgcttt 
cttggtggaa 
gtccgtccac 
gcatctgtas 
gagccttgtt 
ttgtcttgtt 
ttttcctctc 
tctggaggag 
ttctcagact 
ttactgtgat 
ccaacttgta 
ctaagagtgt 
ggcattaaat 
ttctttatcy 
aawkccacat 
catctcccgg 
ggacaaagct 



cataggcact 
aggccaaatt 
gatcttcgta 
attttaaaac 
catcccagaa 
cagcttggtg 
ttttcttatg 
gcttccaatt 
aagaatgtgg 
cttcaggctc 
ttttgttttc 
ttttgctact 
accccacacg 
cattattctg 
ctctttcttc 
tggctttttg 
ttaacttgtt 
tcggaggagt 
taagaacatt 
aattctgttt 
gcatggcatc 
gtttgtgatc 
ctagaactnc 
tttkggcaga 
actcctctca 
gttttttcag 



gcacaacctg 
gaattttcct 
tcttgctggc 
aactgtttct 
ttccttaaca 
gtctccaaaa 
cacaccagaa 
actgctattc 
tatatcactg 
ttctttaagc 
aacaactgaa 

tggggctccc 

tagctttcca 
ggacacaaga 
tttaatttgg 
ctctgcaaga 
atctgtgcct 
attttgctgc 
ctgtaaaact 
gttagtacaa 
tagaagatct 
atgaggctgt 
tycctttcct 
cccagtnraw 
ctgctctttt 
atgttactaa 



«_ «~ w 

ggtcctagat 
tctggattac 
atcatagtct 
ccactcttgt 
ttggcatttg 
tcaagactaa 
accactgcag 
ttataatcct 
tataatttat 
tctggtttta 
gcaattatgt 
attgaatata 
gctgttgtag 
ggtgatgttc 
ttttcaaggg 
tctgctaacc 
acatcactct 
tgagactcag 
tgagatacac 
tgacaatggg 
gtcaaaacag 
ccyttcacac 
ttngccttgt 
kktaancawk 
ccaggcaatt 
ttga 



caggacgtac 
aatattctgg 
tgaagtcttc 
ttttctgccg 
aaatgatgct 
ttgattccgc 
gctgtccttg 
taagccataa 
tattttcatc 
catttatctt 
catcaagaat 
ctggggcaaa 
tcagcaaatc 
tgccatttgg 
taagttcttt 
agtgcagtgg 
ctgggctgct 
gcatgcacag 
cattcattgt 
atttaatacc 
aaccaggtat 
acttcatcca 
aacaatctaa 
gntgcttcac 
ttggcatcct 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1604 



<210> 834 
<211> 617 
<212> DNA 
<213> Homo sapiens 

<400> 834 
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gtccgtcagc tggtagcttt cattcgtaaa agagataaaa gagtgcaggc gcatcgaaaa 60 

cttgtggaag aacagaatgc agagaaggcg aggaaagccg aagagatgag gcggcagcag 120 

aagctaaagc aggccaaact ggtggagcag tacagagaac agagctggat gactatggcc 180 

aatttggaga aagagctcya ssangrtgrm srcrsgkkac gagaaggagt ttggagatgg 240 

atcggatgaa aatgaaatgg aagaacatga actcaaagat gaggaggatg gtaaagacag 300 

tgatgaggcc gaggacgctg agctctatga tgacctttac tgcccagcat gtgacaaatc 360 

gttcaagaca gaaaaggcca tgaagaatca cgagaagtca aagaagcatc gggaaatggt 420 

ggccttgcta aaacaacagc tggaggagga agaagaaaat ttttcaagac ctcaaattga 480 

tgaaaatcca ttagatgaca attctgagga agaaatggaa gatgcaccaa aacaaaagct 54 0 

ttctaaaaaa cagargaaaa agaaacagaa accagcacag gatgtacctg gcaaagattc 600 

atatctgcct gcagctc 617 

<210> 835 
<211> 542 
<212> DNA 
<213> Homo sapiens 



<400> 835 

tttttttttt agaccaacat tctttaatca caaaggcact tgaggacccc tacaaaccca 60 

aagtctctgc caagagtggc cctgcagacg ccccacctgc caccctccat ccacccatcc 120 

atccacacac tcagagttca tcgtgacctg cagagggctc cacactaggc ttgatgaaga 180 

tgccttccat ggccttccac gtattgtgcg tgttggcact gggcatgccg tggacctcat 240 

gctgcccacg gatggggctt ccatactgct cacccgtgac tgacaggaac acagaggtgc 300 

ccacatgctn grarsgcaca gcagcctcac gctcccagnn gctgntccag agcagcgcac 360 

tgtccatann gktccaggtc gtcgccctcg ccgtcttccc caaaggcact cacctcctgg 420 

ttgttggaca gcggcgangg gaagtggtgc gtgtgcaggt tcnttgnccg taagcacatg 480 

cgtgagcctc accgcctgcc cgcagcgcac cgcaagggcc caggcggagc cgacgctcgc 540 



542 



<210> 836 
<211> 542 
<212> DNA 
<213> Homo sapiens 

<400> 836 

tttttttttt agaccaacat tctttaatca caaaggcact tgaggacccc tacaaaccca 60 

aagtctctgc caagagtggc cctgcagacg ccccacctgc caccctccat ccacccatcc 120 

atccacacac tcagagttca tcgtgacctg cagagggctc cacactaggc ttgatgaaga 180 

tgccttccat ggccttccac gtattgtgcg tgttggcact gggcatgccg tggacctcat 240 

gctgcccacg gatggggctt ccatactgct cacccgtgac tgacaggaac acagaggtgc 300 

ccacatgctn grarsgcaca gcagcctcac gctcccagnn gctgntccag agcagcgcac 360 

tgtccatann gktccaggtc gtcgccctcg ccgtcttccc caaaggcact cacctcctgg 420 

ttgttggaca gcggcgangg gaagtggtgc gtgtgcaggt tcnttgnccg taagcacatg 480 

cgtgagcctc accgcctgcc cgcagcgcac cgcaagggcc caggcggagc cgacgctcgc 540 

* c 542 

<210> 837 
<211> 719 
<212> DNA 
<213> Homo sapiens 
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<400> 837 

aaaaggtccc ccttctggga aagaccgagt gaagaaaggt ggatcctaca tgtgccatag 60 

gtcttattgt tacaggtatc gctgtgctgc tcggagccag aacacacctg atagctctgc 120 

ttcgaatctg grnttccgct gtncagccga ccgnctgccc actatngact gacaaccaag 180 

gaaagtcttc cccantccaa ggagcagtcg tgtctgacct acattgggct tttctcagaa 240 

ctttgaacga tcccatgcaa agaattccca ccctgaggtg tttnacatac ctgcccaatg 300 

ncaaaggaac cgccttgtga gaccaaattg ctgacctggg tcagtgcatg tgctttatgg 360 

tgtggtgcat ctttggagat catcgccata ttttactttt gagagtcttt aaagaggaag 420 

gggagtggag ggaaccctga gctaggcttc aggaggcccg cgtcctacgc aggctctgca 480 

caggggttag accccaggtc cgacgcttga ccttcctggg cctcaagtgc cctcccctat 54 0 

caaatgacag ggatggacag catgacctct gggtgtctct ccaactcacc agttctaaaa 600 

agggtatcag attctattgt gacttcataa gtgagaattt atgatagatt attttttagc 660 

tattttttcc atgtgtgaac cttgagtgat actaatcatg taaagtaaga gttccctta 719 

<210> 838 
<211> 579 
<212> DNA 



<400> 838 

aagatatgca gaga tat tec aggatctttt agctttggtg cggtctcctg gagacagtgt 60 

tattcgecaa cagtgtgttg aatatgtcac atccattttg cagtctctct gtgatcagga 120 

cattgeaett atcttaccaa gctcttctga aggttctatt tctganctgg agcagctctc 180 

caattctcta ccaaataaag aattgatgac ctcaatctgt gactgtctgt tggctacget 240 

agctaactct gagagcagtt acaactgttt actgacatgt gtcagaacaa tgatgtttct 300 

tgcagagatg attatggatt atttcattta aaaagttctt taaggaaaaa cagtagtget 360 

ctgcatagtt tactgaaacg agtggtcagc acatttagta aggacacagg agagcttgea 420 

tcttcatttt tagaatttat gagacaaatt cttaactctg acacaattgg gatgctgtgg 480 

gagatgataa tgggtctcat gggaagtagg aggggagctc atacatcacg gacgatgagt 540 

attaatgetg cagagttaaa ccagcttctt ccaaggcaa 579 

<210> 839 

<211> 1172 

<212> DNA 

<213> Homo sapiens 

<400> 839 

aaccaaacct cccaacttag tgaaaacaag gcattcaatg acagaccagc agcagaaact 60 

gentattace tcctaatcat tttatgaaga aatacctata taaaaacaaa cactaaagag 120 

nacaaataga tttaactaaa gtgacaagca taattataaa taaataccag attatcagat 180 

tttaaacaat aatctataac agttttacta tctaaggatt ttcactccaa gaagaaaaaa 240 

tacatagtaa cgccaagctt gcaggacgat gacttaacag atacattttc tcttaatgga 300 

aacttatcta gcttcagtaa tatttctgga tgtagcatca agttgctgtt gcacattttt 360 

aaaagactgg tccagcagtg tttcctcttc atttaaagta ttggcaatag catcattaca 420 

tggattgtcc agaatgtctt cgtttaatcc atttgactcc tccttttgat cctcatcagt 480 

attaacctct tcaaccgtgt gtgccctggg tgtattcatt aacatatcat ttccyagggt 540 

ctgactatta ctcagcagct tkgcctgcct tctttccarg gccagttggt twatttcycy 600 

caattctttg ttgttgctct tctgttaggc ttctacttaa ctcagaagca aacatctcac 660 

tttcagataa gtttgtcaga aagggatcta attcagtaga agtgacatca tgttcattat 720 

tctccgcaac ttcatcatta ttgctaacaa aatcttcatg taaaataggg agatcaagtc 780 

gaattcgttt taaacaggtc tgaacttcct ttttacttcc caggtattca actctgtcaa 840 
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taaaatcctc aaactgcagt 
ttagcatctt caagtcttca 
catgccttaa ggctggaagt 
tatttctttt aactgttctc 
catcaggctc agttccttca 
gaaaagtttc atcttctaca 



ttagggaata gcctatgtgc ccagtgctcc atgtgtctga 
gcctcatgac ctttaccttt gaattttgcc ttatcaaata 
cctctctctg aaattaatct ctgagcatcc agcttgggta 
tttggaggta caggaacagg tgctccattt cctgactctt 
ccatcttgtc tctctggaga ggctggaggt gggaaaggag 
tgctcataat ct 



900 
960 
1020 
1080 
1140 
1172 



<210> 840 

<211> 1145 

<212> DNA 

<213> Homo sapiens 

<400> 840 

cctcctactc ccaaacaaat ctttggggaa aaaaaaacta ccaactgtca gccatgggcc 60 

tgacggcgct aagctctggg gctccgtgca ctgacgtggg gccagccaca gggaggcggg 120 

gatsmrgymg cgngassscm ggakywkgrs cwscwscsrs gymrgkwgca gnrgcrgygg 180 

crhcrsganc mrmagcagcn tgmwgcagct cawgcacctg gagtcctttt aygaaaaamc 240 

yyctcctggg cttatcaagg aagatgagac taagccagaa gattgcatac cagatgtacc 300 

aggcaatgaa cacgccaggg aatttctggc tcatgcacca actaaaggac tttggatgcc 360 

actggggaaa gaagtcaaag ttatgcagtg ttggcgttgc aaacgctatg gtcaccgaac 420 

gggtgacaaa gaatgccctt tctttatcaa aggcaaccaa aagttagagc agttcagagt 480 

ggcacatgaa gatcccatgt atgacatcat acgagacaat aaacgacatg aaaaggacgt 54 0 

aaggatacag cagttaaaac agttactgga ggattctacc tcagatgaag ataggagcag 600 

ctccmgttcc tctgaaggta aagagaaaca caagaaaaag aagaagaaag aaaagcataa 660 

gaaaaggaag aaagaaaaga aaaagaagaa aaaacggaag cacaaatctt ccaagtcaaa 720 

tgagggttct gactcagagt gacaaggatg tgacttgttc aacattctct tctcaaacac 780 

tgaccaagga acagaggaag atgcagtcag agaaagcagc aggatagaga cgccgagaga 84 0 

ggagtatatg tgggtcacag cagtgagctc ccacccgcct tgcagtgaag atgtgacccc 900 

aggagaggga gtgtctcctt ccaggtgcta gctctggaca gcagctgatt ttaggcagga 960 

aagtttcttc atcgttgtcc tccctgctgg tcacatgagt ttacgattcc tttgaagtgt 1020 

ctcccacagg gtggcaggac tgggagaatc tctgaggcgt gtcttccagg ccctcccaca 1080 

gcttgtgccc tccacagtgt ggactcaggt cccatagaca tcaggctgga gtcttctctg 1140 



<210> 841 
<211> 642 
<212> DNA 
<213> Homo sapiens 

<400> 841 

ttttttataa aaataaatat ttattgccat ttgaagcttt atgtacacct ttaaaagcac 60 

atgtacaaat gtgggaaatt acaaaaatca acctaaaacc ctttttctca aagtatacat 120 

aaatgtacat ccaagatcag tggtgctacc atcattagaa taaaaaataa gtctgtctgg 180 

acataaacaa gcaatcattt taagtgtcat tcagatattc tcctttatat ttaaaactcc 240 

aaaaaatact aagaggccca atatatccag aaaattgtgt tttcacttta ccctaactta 300 

tgaatagtgg tatacaaata tatttccatc tttttgtcca gccagcaaat gagagtctgt 360 

acccgaccat ttcacaaaag accaatgttg gtcagagaca gskskgagrr ksgymktasr 420 

stkamysasa akkarstsmm amayrgsrmt tnykcmasra stcamkmtyk ytgsyrcasr 480 

gwkrwctyws rmswmwmmwk msargmmcca tttcagaata ggctttgtga cagactgaag 540 

cttggtaaga atcatcaatg tgcatctttt tcaggagttg accagttttt aaattccaaa 600 

taacaatgtt gttcataata gtagtaccaa gcagagcttc tt 642 



ttgtt 



1145 
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<210> 842 
<211> 452 
<212> DNA 
<213> Homo sapiens 

<400> 842 

acggcctggt ggagcagctg tacgacctca ccctggagta cctgcacagc caggcacact 60 

gcatcggctt cccggagctg gtgctgcctg tggtcctgca gctgaagtcg ttcctccggg 120 

agtgcaaggt ggccaactac tgccggcagg tgcagcagct gcttgggaag gttcaggaga 180 

actcggcata catctrcagc cgccgccaga gggtttcctt cggcgtctct gagcagcagg 240 

cagtggaagc ctgggagaag ctgacccggg aagaggggac acccytgacc ttgtactaca 300 

gccactggcg caantgcgtg accgggagat ccagctggag atcagtggca aagagcggct 360 

ggaagacctg wacttccctg agatcaaacg aaggaagatg gctgacagga aggatgagga 420 

caggwagcaa tttaaagacc tcttttgacc tg 452 

<210> 843 
<211> 805 
< 2 1 2 > DNA 
<213> Homo sapiens 

<400> 843 



atcatgtaaa gtaagagttc cctta 

<210> 844 
<211> 702 
<212> DNA 
<213> Homo sapiens 

<400> 844 
tttttttttt tttttttgca 
atggtctgag gaggacaccc ctcccatggc tttggggagg 
agggcagaaa cacgcggggt gggtgggggc gtggccggag 
cacctggggg tggctcccac ggcaccaggt gggctagggc 
agtgctcctg gacccggtcg gggccggctg gggcccattt 
ggcacagggt ctgagtccca tcttgggctk cagggaccgc 
gacagcgggg gcctttatct gggcccatca ggtggatgag 
caccacctgg gccagggcta ggctatccgg cagggcctcc 
cagaactcaa gccggcatnc aggcagtkgg aacgncccgc 



cagactggtg 


gactgttcat 


60 


ctgggaaaga 


ccgagtgaag 


120 


ggtatcgctg 


tgctgctcgg 


180 


tccgctgtnc 


agccgaccgn 


240 


ntccaaggag 


cagtcgtgtc 


300 


atgcaaagaa 


ttcccaccct 


360 


ttgtgagacc 


aaattgctga 


420 


ggagatcatc 


gccatatttt 


480 


ccctgagcta 


ggcttcagga 


540 


caggtccgac 


gcttgacctt 


600 


ggacagcatg 


acctctgggt 


660 


tattgtgact 


tcataagtga 


720 


gtgaaccttg 


agtgatacta 


780 
805 



taaaaaaatc 


atctgggggc 


60 


acgcaggttc 


caggagtcac 


120 


tggggagggg 


ctgtsccagg 


180 


aacagtatgt 


acaggcgagc 


240 


ctgcggcagg 


ggagctctgg 


300 


gaggscgtcc 


agggaggctg 


360 


aacggacact 


gcaaaccgct 


420 


ccmmctgaat 


cctgcgtgcg 


480 


angctgggct 


tggktgsyck 


540 
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crsgcacgtg acaggtgggg cccgtgtcct gataaacgga caggaacaaa aggaacgcaa 600 
ggtctgggac ccacggctct gggagcagcg ccacccaggc tggctcctag cagagaaatg 660 
ggaatcgcaa atgcattgca atgtgcagtg aagagacgcg ag 702 
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